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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANTS 

(A) NAME i Ludwig Institute for Cancer Research 

(B) STREET: St. Mary'. Hospital Medical School, Norfolk 

Place 

(C) CITY: Paddington, London 

(E) COUNTRY: United Kingdoa 

(F) POSTAL CODE (ZIP) t W2 IPC 

(ii) TITLE OF INVENTION: PROTEINS HAVING SERINE /THREONINE KINASE 
DOMAINS , CORRESPONDING NUCLEIC ACID MOLECULES, AND THEIR 
USE 



(lii) NUMBER OF SEQUENCES: 29 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC coop* tibia 

(C) OPERATING SYSTEM: PC-DOS /H? -DOS 

(D) SOFTWARE: Patantln Release #1.0, Varaion #1.25 (EPO) 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1984 baaa pair. 

(B) TYPE: nuclaic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linaar 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM! Homo sapiens 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 283. .1791 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
AGGAAACCCT TTATTAGCAC GGAGTCCTCG AGCTCCGCCA GGCAGGAAGA CGCTCCAATA 
ACAAACATTT TTGCTCCAGC CCCCATCCCA CTCCCGCGAG GCTCCCGCCC CAGCTGCGCC 
GAGCCAGCCC CTCCCCGGCT CCAGCCCGGT CCGGGGCCGC CCCGGACCCC AGCCCCCCCT 
CCACCGCTGC CGGTGCAACT CCGGCCCCGC GGTGGAGGCG AGGTGGCCCC GGTCCGCCCA 



60 
120 
180 
240 
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AGCCTAGCGC CCCGCCACCC GCAGAGCGGC CCGAGAGGGA CC ATO ACC TTC CGC 294 

Met Thr Lau Gly 
1 

TCC CCC AGC AAA CCC CTT CTG ATC CTG CTC ATC CCC TTC CTC ACC CAC 342 
Sar Pro Arg Lye Gly Leu Leu Mot Leu Leu Met Ala Leu Val Thr Cln 
5 10 15 20 

CCA GAC CCT CTC AAO CCC TCT CCC CCC CCC CTC CTC ACC TCC ACC TCT 390 
Cly Asp Pro Val Lya Pro Sar Arg Gly Pro Leu Val Thr Cya Thr Cya 
25 30 35 

CAG ACC CCA CAT TCC AAG CCC CCT ACC TCC CCC CCC CCC TCC TCC ACA 438 
Clu Sar Pro Hia Cya Lye Cly Pro Thr Cya Arg Cly Ala Trp Cya Thr 
40 45 50 

GTA CTG CTG CTC CCC CAG CAG GCC AGC CAC CCC CAC GAA CAT CCC CCC 486 
Val Val Lou Val Arg Clu Clu Gly Arg Hia Pro Gin Glu Hia Arg Cly 
55 €0 65 

TCC CGC AAC TTC CAC AGC CAC CTC TCC ACC CCC CCC CCC ACC CAC TTC 534 
Cya Cly Aan Leu Hia Arg Clu Leu Cya Arg Gly Arg Pro Thr Clu Pha 
70 75 60 

CTC AAC CAC TAC TCC TCC CAC AGC CAC CTC TCC AAC CAC AAC CTC TCC 582 
v&l Aan Hia Tyr Cya Cya Asp Sar Hia Leu Cya Aan Hia Aan Val Sar 
85 90 95 100 

CTC CTG CTG CAG CCC ACC CAA CCT CCT TCC CAG CAC CCC CCA ACA CAT 630 
Leu Val Lou Clu Ala Thr Cln Pro Pro Sar Clu Cln Pro Gly Thr Aap 
105 110 115 

CGC CAG CTC CCC CTC ATC CTG CCC CCC CTC CTC CCC TTC CTC CCC CTC 678 
Cly Gin Lou Ala Lou Ila Lou Gly Pro Val Lau Ala Lau Lau Ala Lau 
120 125 130 

CTC GCC CTG CCT CTC CTC CCC CTC TCC CAT CTC CCA CCC AGC CAC CAC 726 
Val Ala Leu Cly Val Lau Cly Lou Trp Hia Val Arg Arg Arg Gin Clu 
135 140 145 

AAC CAG CCT CCC CTC CAC AGC GAG CTC CCA CAC TCC AGT CTC ATC CTG 774 
Lya Gin Arg Cly Lau Hia Sar Clu Lou Gly Glu Sar Sar Lau Ila Lau 
150 155 160 

AAA CCA TCT CAG CAG CCC CAC ACC ATC TTC CGC CAC CTC CTC CAC ACT 822 
Lya Ala Sar Clu Cln Gly Aap Thr Mat Lau Cly Aap Lau Lau Aap Sar 
165 170 175 180 

GAC TCC ACC ACA GCC AGT CCC TCA GGG CTC CCC TTC CTG CTC CAG AGC 870 
Aap Cya Thr Thr Gly Sar Cly Sar Cly Lau Pro Pha Lau Val Cln Arg 
185 190 195 

ACA CTG CCA CCC CAG CTT CCC TTC CTG CAG TCT CTG CCA AAA GCC CCC 918 
Thr Val Ala Arg Gin Val Ala Lau Val Clu Cya Val Cly Lya Cly Arg 
200 205 210 

TAT GCC GAA CTG TCC CCG GCC TTC TGG CAC GGT GAG AGT CTC CCC CTC 966 
Tyr Gly Clu Val Trp Arg Cly Leu Trp Hia Gly Glu Ser Val Ala Val 
215 220 225 
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AAG ATC TTC TCC TOO AGO OAT OAA CAG TCC TOO TTC CCC 6A0 ACT OAO 1014 
Lys 11m Phe Ser Ser Arg Asp Clu Gin Ser Trp Phe Arg Clu Thr Glu 
230 235 240 

ATC TAT AAC ACA GTA TTG CTC AGA CAC GAC AAC ATC CTA GGC TTC ATC 1062 
11m Tyr Asn Thr Val Leu Leu Arg His Asp A»n lie Leu Gly Phe 11m 
245 250 255 260 

GCC TCA GAC ATG ACC TCC CGC AAC TCG AGC ACG CAG CTC TGC CTC ATC 1110 
Ala Smr Asp Kst Thr Ser Arg Asn Ser Ssr Thr Gin Leu Trp Leu 11m 
265 270 275 

ACG CAC TAG CAC GAG CAC GCC TCC CTC TAC CAC TTT CTC CAG AGA CAG 1158 
Thr His Tyr His Glu His Gly Ser Lmu Tyr Asp Phs Leu Gin Arg Gin 
280 285 290 

ACG CTC CAG CCC CAT CTC CCT CTC AGO CTA CCT CTC TCC COG CCA TCC 1206 
Thr Leu Glu Pro Bis Lmu Ala Leu Arg Leu Ala Val Ser Ala Ala Cys 
295 300 305 

GCC CTC GCC CAC CTC CAC GTG CAG ATC TTC CCT ACA CAC CCC AAA CCA 1254 
Gly Leu Ala His Leu His Val Clu He Phe Gly Thr Gin Gly Lys Pro 
310 315 320 

CCC ATT CCC CAC CCC GAC TTC AAG AGC CCC AAT CTC CTC CTC AAC ACC 1302 
Ala He Ala His Arg Asp Phe Lys Ser Arg Asn Val Leu Val Lys Ser 
325 330 335 340 

AAC CTC CAG TCT TCC ATC CCC CAC CTC CCC CTC CCT GTG ATG CAC TCA 1350 
Asn Leu Gin Cys Cys He Ala Asp Leu Gly Leu Ala Val Met His Ser 
345 350 355 

CAG CCC ACC GAT TAC CTC CAC ATC CCC AAC AAC CCC AGA CTC CCC ACC 1398 
Gin Gly Ser Asp Tyr Leu Asp He Gly Asn Asn Pro Arg Val Gly Thr 
360 365 370 

AAG CCC TAC ATG CCA CCC GAC CTC CTC GAC CAG CAC ATC CCC ACC CAC 1446 
Lys Arg Tyr Met Ala Pro Clu Val Leu Asp Clu Gin He Arg Thr Asp 
375 380 385 

TCC TTT CAG TCC TAC AAG TCC ACT CAC ATC TCC CCC TTT CCC CTC CTC 1494 
Cys Phe Glu Ser Tyr Lys Trp Thr Asp He Trp Ala Phe Gly Leu Val 
390 395 400 

CTC TCC GAG ATT CCC CCC CCC ACC ATC GTG AAT CCC ATC CTC GAG CAC 1542 
Leu Trp Clu He Ala Arg Arg Thr He Val Asn Gly He Val Glu Asp 
405 410 415 420 

TAT AGA CCA CCC TTC TAT CAT CTC CTC CCC AAT CAC CCC ACC TTT CAC 1590 
Tyr Arg Pro Pro Phe Tyr Asp Val Val Pro Asn Asp Pro Ser Phe Glu 
425 430 435 

CAC ATC AAG AAG GTG CTC TCT CTC CAT CAC CAC ACC CCC ACC ATC CCT 1638 
Asp Met Lys Lys Val Val Cys Val Asp Gin Gin Thr Pro Thr 11m Pro 
440 445 450 

AAC CGC CTC CCT GCA GAC CCG CTC CTC TCA CCC CTA CCT CAC ATC ATC 1686 
Asn Arg Leu Ala Ala Asp Pro Val Leu Ser Gly Leu Ala Gin Met Met 
455 460 465 
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CCC GAG TGC TOG TXC CCA AAC CCC TCT CCC CCA CTC ACC CCG CTG CGG 1734 
Arg Clu Cys Trp Tyr Pro Am Pro Ser Ala Arg Leu Thr Ala Leu Arg 
470 475 480 

ATC AAG AAG ACA CTA CAA AAA ATT AGC AAC ACT CCA GAG AAG CCT AAA 1782 
lie Lye Lye Thr Leu Gin Lya Zle 5ar Aan Ser Pro Glu Lya Pro Lya 
485 490 495 500 

CTG ATT CAA TAGCCCAGGA CCACCTCATT CCTTTCTGCC TGCAGGCGGC 1831 
Val I la Gin 

TGGGGGGGTG GGGGCCAGTG GATGGTGCCC TATCTGGGTA GAGGTAGTGT CAGTGTGGTG 1891 

TGTGCTGGGG ATGGGCAGCT GCGCCTOCCT GCTCGGCCCC CAGCCCACCC AGCCAAAAAT 1951 

ACAGCTGGGC TGAAACCTGA AAAAAAAAAA AAA 1984 

(2) INFORMATION TOR SEQ ID MO: 2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 503 amino acid* 

(B) TYPE: amino acid 
(D) TOPOLOGY t linaar 

(ii) MOLECULE TYPE: protain 

(xi) SEQUENCE DESCRIPTION! SEQ ZD NO: 2: 

Met Thr Leu Gly Ser Pro Arg Lya Gly Leu Leu Met Leu Leu Met Ala 
15 10 15 

Leu Val Thr Gin Gly Aap Pro Val Lya Pro Ser Arg Gly Pro Leu Val 
20 25 30 

Thr Cya Thr Cya Glu Ser Pro Hia Cys Lya Gly Pro Thr Cya Arg Oly 
35 40 45 

Ala Trp Cye Thr Val Val Leu Val Arg Glu Glu Gly Arg Hia Pro Gin 
50 55 60 

Glu Hia Arg Gly Cya Gly Aan Leu Hie Arg Glu Leu Cya Arg Gly Arg 

65 70 75 80 

Pro Thr Glu Phe Val Aan Hia Tyr Cya Cya Asp Ser Bis Leu Cya Aan 

85 90 95 

Hia Asn Val Ser Leu Val Leu Glu Ala Thr Gin Pro Pro Ser Glu Gin 
100 105 110 

Pro Gly Thr Asp Gly Gin Leu Ala Leu lie Leu Gly Pro Val Leu Ala 

US 120 125 

Leu Leu Ala Leu Val Ala Leu Gly Val Leu Gly Leu Trp Hia Val Arg 
130 135 140 

Arg Arg Gin Glu Lya Gin Arg Gly Leu Hia Ser Glu Leu Gly Glu Ser 
145 150 155 160 
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Ser Leu 21a Leu Lys Ala Sar Clu Gin Oly Asp Thr Mat Leu Cly Asp 

165 170 175 

Lau Leu Asp Sar Asp Cys Thr Thr Cly Sar Cly Sar Cly Lau Pro Pha 

ISO 185 190 

Leu Val cln Arg Thr Val Ala Arg cln Val Ala Lau Val clu Cys Val 

195 200 205 

Cly Lys Cly Arg Tyr Cly Clu Val Trp Arg Cly Lau Trp Bis Cly clu 
210 215 220 

Sar Val Ala Val Lys Xla Pha Sar Sar Arg Asp Clu Cln Sar Trp Pea 

225 230 235 240 

Arg Clu Thr Clu Ila Tyr Asn Thr Val Lau Lau Arg His Aap Asn Zla 
245 250 255 

Leu Cly Pha Zla Ala Sar Asp Mat Thr Sar Arg Aan Sar Sar Thr Cln 

260 265 270 

Leu Trp Leu Zle Thr His Tyr Bit Clu His Cly Ear Leu Tyr Asp Pha 

275 280 285 

Leu Gin Arg Cln Thr Leu Clu Pro His Leu Ala Leu Arg Leu Ala Val 
290 295 300 

Ser Ala Ala Cya Cly Lau Ala Hia Leu His Val Clu Zla Pha Cly Thr 

305 310 315 * 320 

Cln Cly Lys Pro Ala Zla Ala Hit Arg Aap Pha Lys Ser Arg Aan Val 

325 330 33 5 

Leu Val Lys Ser Asn Leu Cln Cya Cys Zle Ala Aap Leu Cly Leu Ala 
340 345 350 

Val Met Kia Ser Cln Cly Ser Aap Tyr Leu Aap Zle Cly Asn Aan Pro 

355 360 365 

Arg Val cly Thr Lya Arg Tyr Met Ala Pro Clu Val Leu Aap Clu Cln 
370 375 380 

Zle Arg Thr Aap Cys Pha Clu Ser Tyr Lya Trp Thr Aap Zla Trp Ala 

385 390 395 400 

Phe Cly Leu Val Leu Trp Clu Zle Ala Arg Arg Thr Zle Val Aan Cly 
405 410 415 

Zle Val Clu Aap Tyr Arg Pro Pro Pha Tyr Aap Val Val Pro Asn Asp 
420 425 430 

Pro Ser Phe Clu Asp Met Lys Lya Val Val Cya Val Aap Cln Cln Thr 
435 440 445 

Pro Thr Zle Pro Asn Arg Leu Ala Ala Aap Pro Val Leu Ser Civ Lau 
450 455 460 

Ala Cln Met Met Arg Clu Cys Trp Tyr Pro Aan Pro Ser Ala Arc Lau 



470 
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480 
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Thr Ala Leu Arg Zle Lye Lys Thr Leu Oln Lys Zle Ser Asn Mr Fro 
48S 490 495 

Glu Lys Pro Lys Val Zle Gin 

500 



(2) INFORMATION FOR SEQ ID NO: 3s 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2724 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linaar 

(ii) MOLECULE TYPE: CDKA 

(iii) HYPOTHETICAL: HO 

(iii) ANTI-SENSE: HO 

(v) FRAGMENT TYPE: intarnal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bono aapians 

(Lx) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 104.. 1630 



(xi) SEQUENCE DESCRIPTION: SEQ ZD HO: 3: 

CTCCGACTAC CCCAGTGACC AGACTCAGAG AAGCTCTCAA CGAGGGCACG CGGCTTGAAG 60 

CACTGTGGGC AGATGTGACC AAGAGCCTGC ATTAACTTCT ACA ATG GTA GAT GGA 115 

Met Val Asp Gly 
1 



GTG ATG ATT CTT CCT GTG CTT ATC ATG ATT GCT CTC CCC TCC CCT ACT 163 
Val Met He Leu Pro Val Leu He Met Zle Ala Leu Pro Ser Pro Ser 
5 10 15 20 

ATG GAA GAT GAG AAG CCC AAG GTC AAC CCC AAA CTC TAC ATG TGT GTG 211 
Met Glu Asp Glu Lya Pro Lys Val Asn Pro Lya Leu Tyr Met Cys Val 
25 30 35 

TGT GAA CGT CTC TCC TCC GOT AAT GAG GAC CAC TGT GAA GGC CAG GAG 259 
Cys Glu Gly Leu Ser Cys Gly Asn Glu Asp Bis Cys Glu Gly Gin Gin 
40 45 50 

TCC TTT TCC TCA CTC ACC ATC AAC GAT GGC TTC CAC GTC TAC CAG AAA 307 
Cys Phe Ser Ser Leu Ser Zle Asn Asp Gly Phe Bis Val Tyr Gin Lys 
55 60 65 

GGC TCC TTC CAG GTT TAT GAG CAG GGA AAG ATG ACC TGT AAG ACC CCG 355 
Gly Cys Phe Gin Val Tyr Glu Gin Gly Lys Met Thr Cys Lys Thr Pro 
70 75 80 



SUBSTITUTE SHEET 



WO 94/11502 

PCT/GB93/02367 



403 



41 

S~ 1°° S CT S? 0 601 CTC 6X6 100 «* CCC CAC TOO WT AM 

Pro S.r Pro Cly Oln Al. V.l Clu Cy. Cy. Gin Oly A.p Trp cyl 

90 »5 100 



AGO AAC ATC ACG CCC CAG CTG CCC ACT AAA GGA AAA TCC TTC CCT ee» », 
Axg A.n XI. Xhr Al. Gin L.u Pro Thr Ly. Glf iji S S cti *" 
105 110 us 

ACA CAG AAT TTC CAC TTC CAG CTT CGC CTC ATT ATT CTC rer erm r-iv 
Thr Gin A.n Ph. Hi. L.u Clu V.l Cly *S Zte 25 JS S2 52 
120 130 



»r- «:? A. . * iA tcc CTG CTG CCA CTT OCT CTC CCA AAA 
Ph. Al* y.l Cy. Leu L.u Al. Cy. L.u L.u Cly V.l Al. 55 Arg X. 

; AGC CGC AAC CAA GAA CGC wtc AAT CCC CCA CAC CTG CAG TAT 

Ph. Ly. Arg Arg A.n Cln Clu Arg Uu A.n Pro Arg A.p V™ Clu Tyr 



499 



547 



595 



643 



691 



739 



787 



52 !S til SI ?™ «* ~5 F» F° 5?A CTT CCT CTC CCA AAA 

145 

TTT AAA AGC CGC AAC CAA CAA CCC CTC AAT CCC CCA CAC CTC CAO TAT 

Arg 
160 

CGC ACT ATC CAA CCC CTC ATC ACC ACC AAT CTT CCA CAC AGC ACT TTX 
Cly Thr II. Clu Cly l*u II. Thr Thr A.n 55 «J £p J£ £ £} 

170 175 180 

CCA GAT TTA TTC CAT CAT TCC TGT ACA TCA CCA ACT CGC TCT CCT CTT 
Al. A.p L.u L.u A.p Hi. S.r Cy. Thr S.r Cly S.r Cly ££ cly 21 
185 190 195 

CCT TTT CTG GTA CAA AGA ACA CTG CCT CCC CAC ATT ACA CTC TTT. e»r 
Pro Ph. L.u Vjl Cln Arg Thr V.l Alt Arg oK-l" S 2! 2! 8S 

TCT CTC COG AAA GCC ACC TAT CCT CAG CTC TCC AGC CGC ACC TCC CAA 
Cy. V.1 j}y Ly. Cly Arg Tyr Cly Clu V.l Trp Arl Oly J« J2 
215 220 225 

S? 6 2** *** S 1 ? C ? C CTC A* 0 ATC TTC TCC TCC CCT CAT CAG AA6 TCA B3S 
Cly Clu A.n V.l Al. V.l Ly. XI. Ph . S.r S.r Arg A.p cTu ij. IS " 5 
*<* u 235 240 

TGC TTC AGC CAA ACC CAA TTC TAC AAC ACT CTC ATC CTC AGC CAT CAA 
Trp Ph. Arg Clu Thr Clu L.u Tyr A.n Thr V.l mS Su Arg SI !!£ 
245 250 255 260 

AAT ATC TTA CGT TTC ATT CCT TCA CAC ATC ACA TCA AGA CAC TCC act 
A.n 21. Leu Cly Ph. II. Al. S.r A.p M.t t£ JS ArJ SS IS sir 
265 270 275 

ACC CAG CTG TGC TTA ATT ACA CAT TAT CAT CAA ATC CCA TCC TTC TAC 
Thr Cln L.u Trp L.u II. Thr Hi. Tyr HI. Clu M.t Cly sir Su Tyr 
280 285 290 

CAC TAT CTT CAG CTT ACT ACT CTG CAT ACA GTT AGC TGC CTT CCA ATA 
A.p Tyr L.u Cln L.u Thr Thr L.u A.p Thr V.l CyS 2u Si Ili 

295 300 305 

CTC CTC TCC ATA CCT ACT CCT CTT CCA CAT TTC CAC ATA CXC xn «r-~ 
V.l Leu S.r II. Al. S.r Cly Lau Al. Bll SS Jli §K }g Jg 
310 315 320 
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CGC XCC CAA 6CC AAA CCA OCC ATT OCC CAT COX CAT TTA MS AGC AAA 
Cly Thr Gin Cly ly* Pro Ala XI* Ala Hit Arg Aap Lau Lya Sar ly* 
325 330 335 340 

AAT ATT CTO CTT AAO AAO AAT CCA CAO TCT TOC ATA CCA CAT TTG CCC 
Aan lie Lau Val Ly* lya Aan Cly Cln Cyi Cy* Ila Ala Aap lau Oly 
345 350 355 

CTG CCA CTC ATC CAT TCC CAO AGC ACC AAT CAO CTT CAT CTC CGC AAC 
Leu Ala Val Mat His Ser Cin Sar Thr Aan Cln Lau Aap Val Cly Aan 
360 365 370 

AAT CCC CGI CTC CGC ACC AAC CGC TAC ATC CCC CCC CAA CTT CTA CAT 
Aan Pro Arg Val Cly Thr lye Arg Tyr Mat Ala Pro Clu Val Leu Aap 

375 380 385 

CAA ACC ATC CAG CTC CAT TCT TTC CAT TCT TAT AAA ACC CTC CAT ATT 
Clu Thr 11* Cln Val Asp Cya Ph* A*p Ser Tyr lya Arg Val Aap Ila 
390 395 400 

TCC CCC TTT CGA CTT CTT TTC TGC CAA CTG CCC ACC CCC ATC CTO ACC 
Trp Ala Ph* Gly Lau Val leu Trp Glu Val Ala Arg Arg Met Val Sar 
405 410 415 420 

AAT CCT ATA CTG GAG CAT TAC AAC CCA CCG TTC TAC CAT CTC CTT CCC 
Aan Cly II* V«l Clu A»p Tyr Lya Pro Pro Ph* Tyr Aap Val Val Pro 
425 430 435 

AAT CAC CCA AGT TTT CAA CAT ATC AGC AAG CTA CTC TCT CTG CAT CAA 
A*n Aap Pro S*r Ph* Clu Aap M*t Arg Ly* Val Val Cya Val Aap Oln 
• 440 445 450 

CAA AGG CCA AAC ATA CCC AAC AGA TGC TTC TCA CAC CCC ACA TTA ACC 
Cln Arg Pro A*n Ila Pro Aan Arg Trp Pha Sar A*p Pro Thr Lau Thr 
455 460 465 

TCT CTG CCC AAG CTA ATC AAA GAA TCC TGC TAT CAA AAT CCA TCC CCA 
Ser Leu Ala Lyi Lau Mat Ly* Clu Cy« Trp Tyr Cln A*n Pro S*r Ala 
470 475 480 

ACA CTC ACA GCA CTG CCT ATC AAA AAG ACT TTG ACC AAA ATT CAT AAT 
Arg Lau Thr Ala L*u Arg II* Ly* Ly* Thr L*u Thr Ly* II* A*p Aan 
485 490 495 500 

TCC CTC GAC AAA TTG AAA ACT CAC TGT TGACATTTTC ATACTCTCAA 
Ser Leu Aap Ly* Lau Lya Thr Aap Cya 
505 

GAAGGAAGAT TTCACGTTCT TCTCATTGTC CAGCTGGGAC CTAATCCTCC CCTCACTCGT 
TGTCAGAATG CAATCCATCT CTCTCCCTCC CCAAATCCCT CCTTTCACAA CCCACACCTC 
CTACCCAGCC ATGTGTTGCC CACACATCAA AACCACCCTA ACCTCCCTCC ATCACTGTCA 
ACTCCCCATT TCACCAACTC TTCACACTCC AGAGACTAAT CTTCCACACA CACTCTTCCA 
AACCTAGGGA CTGGAGCAAC ACACACAAAT CCTAAAAGA0 ATCTCCCCAT TAACTCAOTC 
CCTTTCCATA CCTTTCACAA CTCTCCTACA CACTCCCCAC CCCAAACTCA ACCACCTCCT 



1123 

1171 

1219 

1267 

1315 

1363 

1411 

1459 

1507 

1555 

1603 

1650 

1710 
1770 
1830 
1890 
1950 
2010 
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CATTCCTTAC 
GCCTACTCCA 
TGTCAGACTT 

aattctttat 
aactgctttc 

ATGTTTTTAA 
TTTAAGTGCT 
ACGTATTTAO 
TTTCAGTAGA 
ATTACGTGCA 
TATTTAGTAG 



TCAGCAATAT 
TTGCACTGTT 
CTGGTCTGTC 
TCCTGCATTT 
ACACAACTTT 
TGCATATGTT 
CACTATACTC 
TCACATTTCT 
CCATTACCCA 
ATTTTAGTCC 
TTTAAACTCT 
TTATTTGTAT 



TGCCTGTCCT 
ACTCTTAATT 
TTTGCATAAT 
TACACATGTG 
GCAAATTATT 
AAAGCTTATT 
TAAAATGGAC 
ATCTCTGTAG 
CGTGACAGCA 
TGAACGCTAC 
GCCAGAAAAA 
AAATTAAA7A 



43 

TCTCTTCTTT 
TTAAAGACCC 
ACGAATTCAA 
CTCATGTTTA 
TATTACTTGT 
TTTATCTGGT 
XTTTTCTTTT 
ACTGTAACTT 
CCGAATATAT 
GGGGAAAATG 
AATAACTATT 
AACtGTTTXC 
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attgcactag 
aacttgccaa 

TTTGGCAAAA 
CAATGATGCC 
GCACTTAGTA 
CTTATCATTT 
ATTATCAGTT 
TTTTTCAGTT 
TATCGATTTA 
CATTTTCTTC 
TTGTTTTAAT 
AAGTCAAAAA 



GAATTCTTTO 
AATGTTCGCT 
CAAAATGTAA 
CAACAT7AGG 
CTTTTXACAA 
TATTACACAA 
AAAATCACAT 
CATATGCAGA 
GAAGGAAAGA 
ACAATTATCC 
CTACTTTTTC 
AAAA 



(2) INFORMATION FOR SEQ ID NOt 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) X*ENGTH: 509 amino acidi 

(B) TYPE: Amino acid 
(D) TOPOLOGY: linaar 

(ii) MOLECULE TYPE: protein 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
Mat val Asp Cly Val Mat 11. Lau Pro Val La« II. Hat II. Ala I*u 

I s 

Pro Ser Pro Ser Ket Clu A.p Olu Ly. Pro Ly. Val Aan Pro Ly. U« 
20 25 



Tyr Met 



eye Val eye Clu Oly Leu Ser Cy« Cly Aan Clu Aap Hia Cya 



35 



clu cly 

50 



Gin Gin Cy. Pha sar Sar Lau Sar Ila A.n Aip Cly Pha Hia 



55 



val Tyr cln Ly. Cly Cy. Ph. Cln Val Tyr Clu Cln Cly Ly. Hat Thr 

Cy. Ly. Thr Pro Pro Sar Pro Cly Cln Ala Val Clu Cy. Cy. Cln Cly 

85 

Aap Trp Cy. A.n Arc A.n II. Thr Ala Cln L.» Pro Thr Ly. Cly Ly. 



100 



ser Phe Pro Cly Thr Cln A.n Ph. Hi. I*u Clu Val Cly L.» II. II. 



2070 

2130 

2190 

2250 

2310 

2370 

2430 

2490 

2550 

2610 

2670 

2724 
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Leu Ser Val Val Phe Ala Val Cys Leu Lau Ala Cys Leu Leu Oly Val 

130 135 140 

Ala Lou Ar9 Ly« Phe Lye Arg Arg Am Gin Glu Arg Leu Asn Pro Arg 
145 150 155 160 

Aap Val Clu Tyr Cly Thr I la clu Cly Lau Zla Thr Thr Asn Val Cly 
165 170 175 

Asp Ser Thr Leu Ala A*p Lau Leu Asp Bis 5ar Cya Thr Sar Gly Sar 
180 185 190 

Cly Ser Gly Lau Pro Pha Lau Val Gin Arg Thr Val Ala Arg Gin Ila 

195 200 205 

Thr Leu Lau Glu Cya Val Cly Lys Gly Arg Tyr Cly Clu Val Trp Arg 
210 215 220 

Gly Ser Trp Cln Gly Glu Asn Val Ala Val Lys Ila Pha Sar Sar Arg 
225 230 235 240 

Asp Clu Lys Sar Trp Pha Arg Clu Thr Glu Lau Tyr Asn Thr Val Mat 
245 250 255 

Lau Arg Hia Glu Asn Zla Lau Gly Pha Zla Ala Sar Asp Hat Thr Sar 

260 265 270 

Arg His Ser Sar Thr Gin Leu Trp Lau lie Thr His Tyr His Glu Mat 

275 280 285 

Cly Ser Leu Tyr Asp Tyr Leu Cln Lau Thr Thr Leu Asp Thr Val Sar 
290 295 300 

Cys Leu Arg Zle Val Leu Ser lie Ala Sar Gly Leu Ala Bis Leu Bis 

305 310 315 320 

Zla Clu Zle Phe Gly Thr Cln Cly Lys Pro Ala Zla Ala Bit Arg Asp 

325 330 335 

Leu Lys Ser Lye Asn Zle Leu Val Lye Lys Asn Cly Gin Cys Cys Zle 

340 345 350 

Ala Aep Leu Gly Leu Ala Val Met Hie Sar Cln Ser Thr Asn Gin Leu 

355 360 365 

Asp Val Cly Am Asn Pro Arg Val Cly Thr Lys Arg Tyr Met Ala Pro 
370 375 380 

Clu Val Leu Asp Clu Thr Zle Gin Val Asp Cys Pha Asp Sar Tyr Lys 
385 390 395 400 

Arg Val Asp Zla Trp Ala Phe Gly Leu Val Leu Trp Glu Val Ala Arg 
405 410 415 

Arg Met Val Ser Asn Cly Zle Val Clu Asp Tyr Lys Pro Pro Phe Tyr 
420 425 430 

Asp Val Val Pro Asn Asp Pro Ser Phe Glu Asp Met Arg Lys Val Val 



435 
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0/1 ** P ClD Cln pro A,a 21# Pro Aan Trp Fha Sar Asp 
*50 455 460 

Pro Thr Lau Thr Sar L*u Alt Ly§ Lau Mat Lya Clu Cya Trp Tyr Oln 
465 470 475 4S0 

A»n Pro Sar Ala Arg Lau Thr Ala Lau Arg lis Ly« Lya Thr Lau Thr 
485 490 495 

Lya Ila Asp Am Sar Lau Aap Lya Lau Lya Thr Aap Cya 

500 505 



(2) INFORMATION FOR SEQ ID NO: Si 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH i 2932 baaa paira 

(B) TYPE: nuclaic acid 

(C) STRANDEDNESS: untaovn 

(D) TOPOLOGY s linaar 

(11) MOLECULE TYPE: CDNA 

(ill) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: intarnal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hooo aapiana 

(ix) FEATURE: 

(A) N AXE /KEY: CSS 

(B) LOCATION: 310. .1905 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CCTCCGCGCC CAGGGCTCGA GGATGCGTTC CCTCGGGTCC GCACTTATGA AAATATCCAT 
CAGTTTAATA CTCTCTTCCA ATTCATGACA TGGAAGCATA CGTCAAAGCT CTTTCGAGAA 
AATCAGAAGT ACAGTTTTAT CTACCCACAT CTTGGACGAG TCGTAAGAAA GCAGTCCGAG 
TTCAAGTCAT TGTCAAGTGC TTCCGATCTT TTACAAGAAA ATCTCACTCA ATGATAGTCA 
TTTAAATTGC TCAAGTAGCA AGACCAATTA TTAAACGTGA CAGTACACAG GAAACATTAC 
AATTCAACA ATG ACT CAG CTA TAC ATT TAC ATC ACA TTA TTC CCA CCC 



TAT TTG TTC ATC ATT TCT CGT GTT CAA CGA CAG AAT CTC CAT ACT ATC 
Tyr Leu Phe Ila Ila Sar Arg Val Gin Cly Cln Aan Lau Aap Ser Mat 



60 
120 
180 
240 
300 
348 

396 




15 



20 



25 



WO94/H502 - PCT/CB93/02M, 

CTT CAT CGC ACT GGC ATC AAA TCA CAC TCC CAC CAG AAA AAG TCA GAA 444 
Leu His Ciy Thr Cly Met Lye Ser Asp Ser Asp Gin Lys Lys Ser Glu 
30 35 40 45 

AAT GGA CTA ACC TTA CCA CCA CAG CAT ACC TTG CCT TTT TTA AAO TGC 492 
Asn Giy Val Thr Leu Ala Pro Clu Asp Thr Leu Pro Phe Leu Lys Cys 
50 55 60 

TAT TCC TCA CGC CAC TOT CCA CAT CAT CCT ATT AAT AAC ACA TGC ATA 540 
Tyr Cys Ser Cly Bis Cys Pro Asp Asp Ale Zle Asa Asn Thr Cys He 
65 70 75 

ACT AAT GGA CAT TGC TTT CCC ATC ATA CAA GAA CAT CAC CAG GGA GAA 588 
Thr Asn Cly His Cys Phe Ale He He Clu Clu Asp Asp Gin Gly Clu 
B0 85 90 

ACC ACA TTA CCT TCA CGC TGT ATC AAA TAT CAA GGA TCT CAT TTT CAC 636 
Thr Thr Leu Ale Ser Cly Cys Met Lys Tyr Clu Cly Ser Asp Phe Gin 
95 100 105 

TGC AAA CAT TCT CCA AAA CCC CAG CTA CCC CGG ACA ATA CAA TGT TCT 684 
Cys Lys Asp Ser Pro Lye Ale Gin Leu Arg Arg Thr He Glu Cys Cys 
"0 115 120 1 125 

CGG ACC AAT TTA TCT AAC CAG TAT TTG CAA CCC ACA CTG CCC CCT GTT 732 
Arg Thr Asn Leu Cys Asn Gin Tyr Leu Cln Pro Thr Leu Pro Pro Vel 
130 135 140 

GTC ATA GGT CCG TTT TTT CAT GGC AGC ATT CCA TCC CTG CTT TTG CTC 780 
Val He Gly Pro Phe Phe Asp Gly Ser Zle Arg Trp Leu Vel Leu Leu 
145 150 155 

ATT TCT ATG GCT CTC TGC ATA ATT CCT ATG ATC ATC TTC TCC AGC TCC 828 
He Ser Met Ale Vel Cye He He Ale Met He He Phe Ser Ser Cvs 
160 165 170 

TTT TGT TAC AAA CAT TAT TGC AAC AGC ATC TCA AGC ACA CCT CGT TAC 876 
Phe Cys Tyr Lys His Tyr Cys Lys Ser Zle Ser Ser Arg Arg Arg Tyr 
175 180 185 

AAT CGT CAT TTC GAA CAG CAT CAA CCA TTT ATT CCA GTT GGA GAA TCA 924 
Aan Arg Asp Leu Glu Cln Asp Clu Ala Phe Zle Pro Val Cly Clu Ser 
190 195 200 205 

CTA AAA GAC CTT ATT CAC CAG TCA CAA AGT TCT CGT ACT GGG TCT CCA 972 
Leu Lys Asp Leu He Asp Cln Ser Cln Ser Ser Gly Ser Cly Ser Gly 
210 215 220 

CTA CCT TTA TTG CTT CAC CCA ACT ATT CCC AAA CAC ATT CAG ATG CTC 1020 
Leu Pro Leu Leu Val Cln Arg Thr He Ale Lye Cln He Cln Met Vel 
225 230 235 

CGG CAA CTT CGT AAA CGC CCA TAT CCA GAA CTA TGC ATG GGC AAA TCC 1068 
Arg Cln Val Cly Lys Cly Arg Tyr Cly Clu Val Trp Met Ciy Lys Trp 
240 245 250 

CGT GGC CAA AAA GTC GCC CTG AAA CTA TTC TTT ACC ACT GAA GAA GCC 1116 
Arg Cly Clu Lys Val Ale Val Lye Val Phe Phe Thr Thr Clu Clu Ale 
255 260 265 



f'-' . * WO 94/11502 
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47 



2 s s 2 2 s 2 s 2 2 2 2 22 j- s »« 

280 285 

2 2 2 2 2 2 2 2 2 2 2 2 £ g 2 IS »» 



300 

2 2 2 2 2 2 2 2 2 »= - - - - - s 

310 315 

-222222222222222 

* w 325 330 * 

IS S 22 12 !?I 2! 5?- T 5» P C ^ CAC ACA CAA ATT 

345 



410 



CAC CAA ACC CTC AAC AAA AAC CAC TTC CAC CCC TAC ATC ATC err r»n 
A.p Clu Ser Leu A.n Ly. A.n Hi. , ta cln « Tyr x2 X S Sp 



475 



1308 



jn ^ i= K 2 Si S 2 2 2 2 2 S 2 £ "» 
jg 2 2 2 £?| 2 a S 2 B5 2 222 - 

360 355 

£ 2 2 2 2 2 2 2 £ 2 2 2 2 2 2 2 »» 

375 380 

2 2 2 2 2 2 2 2 2 S 2 2 2 2 55 2 «- 

390 395 
TTC AAT ACC AGO CTC CCC ACC AAA CCC TAC ATC CCT CCC ca» r-nr r~~ 

Leu Afln ISS ^ V41 cl * Th * is; g? X aS S ££ 55 2 1548 



1596 



1644 



2 2 2 2 2 2 2 S 2 K S 2 2 2 2 2 

S 440 445 

ACA CCA CCC ATC CTC CAA CAA TAC CAA TTC CCA TAT TAC A»r irr »*. 
Thr Cly Cly II. jjl Clu Clu Tyr Cln 2 S ^ g «} »«« 

455 460 

2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 



CCT TTC CGC CCA ATT CTC TCT AAT CCC TCC XXG &ri> -*» 

at, u~ j,, ,„ n. ¥>1 s . t j 222 2 2 2 2 2 

CCA CCA CTT TTC AAC CTA ATC TCA CAA TCC TCC CCC CAC AAT rr, ~~ 
Arg ji j val Leu Ly. Leu Met Ser Clu Cy. Jg K £2 £J S " 36 



505 



WO 94/11502 
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48 



TCC ACA CTC ACA OCA TTO ACA ATT AAO AAO ACQ CTT OCC AAO ATO CTT 



CAA TCC CAA CAT OTA AAA ATC TCATCCTTAA ACCATCOCAC CACAAACTCT 
Clu Ser Gin Asp Val Lys lie 
530 

ACACTCCAAC AACTCTTTTT ACCCATOCCA TCOCTOCAAT TACACTCCAA TAACGATCTT 
AACTTCCTTC TCAOACTCTT TCTTCACTAC CTCTTCACAO OCTCCTAATA TTAAACCTTT 
CACTACTCTT ATTACGATAC AACCTGGGAA CTTCTAAACA CTTCATTCTT TATATATCCA 
CACCTTTATT TTAAATCTCC TTTTTCATCC CTTTTTTTAA CTCCCTTTTT ATCAACTCCA 
TCAACACTTC AATCCTCATT AGTCTCTCCA CTCAACCTCT CCGTACTGAA TT OCC T O TT C 
ATAAAACGGT CCTTTCTCTO AAAGCCTTAA CAACATAAAT CACCCCACCA GACATGGAGA 
AATACACTTT CCCTTTTACC TCAGACATTC AGTTCCTTTG TATTCTACCT TTOTAAAACA 
CCCTATAGAT CATGATGTGT TTGCGATACT GCTTATTTTA TCATACTTTC TCCT C T C TCC 
TTAGTCATGT CTCTGTCTCT CCATGCACAT GCACGCCGCC ATTCCTCTCC TCCCATTTCA 
ATTAGAACAA AATAATTTAT ATGCATGCAC AGGAAGATAT TGGTGGCCGG TCGTTTTCTG 
CTTTAAAAAT GCAATATCTG ACCAAGATTC GCCAATCTCA TACAAGCCAT TTACTTTCCA 
AGTGAGATAG CTTCCCCACC AGCTTTATTT TTTAACATGA AAGCTGATGC CAAGGCCAAA 
AGAAGTTTAA AGCATCTCTA AATTTGGACT GTTTTCCTTC AACGACCATT TTTTTTCTCG 
TTATTATTTT TGTCACGCAA AGCATCCTCT CCAAAGTTGG AGCTTCTATT GCCATGAACC 
ATGCTTACAA AGAAAGCACT TCTTATTGAA GTGAATTCCT OCATTTGATA GCAATGTAAG 
TCCCTATAAC CATGTTCTAT ATTCTTTATT CTCACTAACT TTTAAAAGGG AACTTATTTA 
TATTTT C TGT ATAATGTGCT TTATTTGCAA ATCACCC 

(2) INFORMATION TOR SEQ 2D NOt 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH t 532 amino acids 

(B) TYPEt amino acid 
(0) TOPOLOGY s linaar 

(ii) MOLECULE TYPE: protain 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOl 6: 

Mat Thr Gin Leu Tyr Xla Tyr Ilo Arg Lau Lau Gly Ala Tyr Lau Pba * 
1 5 10 is 

Ila lla Ser Arg Val Cln Gly Gin Aan Lau Asp Ser Mat Leu Bia Oly 



1684 

1935 

1995 

2055 

2115 

2175 

2235 

2295 

2355 

2415 

2475 

2535 

2595 

2655 

2715 

2775 

2835 

2895 

2932 




20 



25 



30 
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Thr Cly M.t Ly. s.r Asp S.r A.p Cla Ly. Ly. .. r Clu ^ cl V4l 

40 45 

Thr !..» Al. Pro Clu A.p Thr L.u Pro Ph. L.u Ly. Cy. Tyr Cy. s.r 

9V 55 60 

Cly Hi. cy. Pro A.p A.p Al. II, A.n A.n Thr Cy. II. thr A*n Cly 



Hi. cy. Ph. Ala II. ii. clu Clu A.p A.p cln Cly clu Thr Thr 



90 



80 
L«u 



95 



Al. s.r Cly Cy. M.t Ly. Tyr Clu cly s.r A.p Ph. cla Cy. Ly. A. P 

w 105 210 

S.r fro Ly. Ala Cln I*u Ar, Arg Thr 11. Clu Cy. cy. Are Thr A.a 

120 X25 

L.u cy. A.« Cla Tyr L.u Cln Pro Thr L.u Pro Pro Val v.l II. cly 

" w 135 

Pro Ph. Ph. A.p Cly s.r II. Arg Trp L.u Val Leu L.u II. s.r H.t 

" U 155 160 

Al. V.l Cy. II. II. xi. M.t II. ii. p h . s.r S.r Cy. Ph. Cy. Tyr 

170 

Ly. Hi. Tyr Cy. Ly. S.r II. s.r S.r Arg Ar, Arg Tyr A.n Arg A.p 

185 190 

Leu Clu Gin A.p Clu Al. Ph. II. Pro V.l Cly Clu S.r L.U Ly. A.p 

x " 200 205 

L.u II. A.p Cln S.r Cla S.r S.r Cly s.r Cly s.r Cly L.u Pro L.u 

215 220 

2*5 Thr Hi A1 * **" Cln «J V.1 Arg Cla Val 



240 



Cly Ly. Cly Arg Tyr Cly Clu V.l Trp M.t Cly Ly. Trp Arg Cly Clu 

250 255 

Ly. val Al. V.l Ly. V.l Ph. Ph. Thr Thr Clu Clu Ala S.r Trp Ph. 

Arg Clu Thr Clu II. Tyr Cln Thr Val L.u M.t Arg Hi. clu A.n II. 

280 285 

L.u Cly Ph. II. Ala Al. A.p il. Ly. cly Thr Cly s.r Trp Thr Cla 

300 

Lju Tyr L.u II. Thr A.p Tyr Hi. clu A.n Cly s.r L.u Tyr A.p Ph. 

315 320 
L.u Ly. Cy. Ala Thr L.u A.p Thr Arg Ala L.u Leu Ly. i*u Ala Tyr 

S.r Ala Ala Cy. Cly L.u Cy. Hi. Lju Hi. Thr Clu II. Tyr Cly Thr 



et tPf Tin itt» ruccr 
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SO ^ 

Cln Cly Ly. Pro AU 21. Al. Hi. Arg A.p l*u L y. f.r Ly. A.n XI. 

355 360 355 

L.u II. Ly. Ly. A.n Cly S.r Cy. Cy. H. Al. A.p L«u Cly L«a Al. 

310 375 

V.l Ly. Ph. A.n S.r A.p Thr A*n Clu V.l A*p v.l Pro L«u A.n Thr 

385 390 395 40Q 

Arg V.l cly Thr Ly. Arg Tyr K.t Al. Pre Glu V.l L.u A.p Clo S.r 
405 410 415 

L«u A.n Ly. A.n Hi. Ph. Cln Pro Tyr II. Mat Al. A.p H. Tyr fl.r 
420 «5 430 

Ph. Cly Leu II. II. T rp Clu K.t Al. Arg Arg Cy. II. Thr Cly Cly 
435 440 445 

a£ C1U ° 1u Tyr Cln L ' u Pro *y *■» V.l Pro S.r A«p 
* ao 455 460 

Pro S.r Tyr Clu A.p K.t Arg Clu V.l V.l Cy. V.l Ly. Arg L«u Arg 
465 47 ° 475 480 

Pro II. val S.r A.n Arg Trp Atn s.r Asp Clu Cy. L.u Arg Al. V.l 
485 490 49 5 

L.u Ly. L.u M.t s.r Clu Cy. Trp Al. Hi. A.n Pro Al. S.r Arg Lmu 

5 °0 505 sio 

Thr Al. L.u Arg II. Ly. Ly. Thr L.u Al. Ly. K.t V.l Clu S.r Cln 

515 520 525 

A.p Val Ly. II. 

530 



(2) INFORMATION FOR SEQ ID NO: 7: 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2333 bit. pair. 

(B) TYPE: nucleic .eld 

(C) STRANDEDNESSi unknown 

(D) TOPOLOCVi lin.tr 

(ii) MOLECULE TTP2t cONA 

(iii) HYPOTHETICAL! MO 

(iii) ANTI-SENSE: NO 

(v) FRAGMENT TYPEt int.rnal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM i Homo .apl.n. 

(ix) FEATURE: 

(A) NAKE/XZTl CDS 

(B) LOCATION: 1..1S15 



rtiorrrrt !Tt CUCCT 
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51 

(xl) SIQUKNCB DISCRZPTZONl 8SQ ZD HOl 7 s 

ATC CCG GAG TCG CCC GGA CCC TCC TCC TTC TTC CCC CTT CTT GTC CTC 48 
Met Ala Clu Ser Ale Cly Ale Ser Ser Fhe Phe Pro Leu Vel Vel Leu 
15 10 15 

CTC CTC CCC GGC AGC CGC CGC TCC CCC CCC CCC CCG CTC CAC GCT CTC 96 
Leu Leu Ale Gly Ser Cly Cly Ser Cly Pro Arg Gly Vel Gin Ale Leu 
20 25 30 

CTG TCT CCG TGC ACC AGC TCC CTC CAC CCC AAC TAC ACG TCT GAG ACA 144 
Leu Cye Ale Cye Thr Ser Cye Leu Gin Ale Asn Tyr Thr Cye Glu Thr 
35 40 45 

CAT CGC CCC TGC ATC CTT TCC TTT TTC AAT CTC CAT CGG ATC GAG CAC 192 
Asp Gly Ale Cye Ket Vel Ser Phe Phe Aen Leu Aep Gly Met Glu Bie 
50 55 60 

CAT GTC CGC ACC TGC ATC CCC AAA GTC GAG CTG GTC CCT GCC CGG AAC 240 
Hie Val Arg Thr Cys lie Pro Lye Vel Clu Leu Vel Pro Ale Cly Lye 
65 70 75 80 

CCC TTC TAC TGC CTC ACC TCC CAC CAC CTG CCC AAC ACC CAC TCC TCC 288 
Pro Phe Tyr Cys Leu Ser Ser Clu Asp Leu Arg Aen Thr His Cye Cye 
85 90 95 

TAC ACT CAC TAC TCC AAC ACC ATC CAC TTC ACG CTC CCC ACT CCT CAC 336 
Tyr Thr Asp Tyr Cys Asn Arg lie Asp Leu Arg Vel Pro Ser Cly Hie 
100 105 no 

CTC AAC GAG CCT CAC CAC CCC TCC ATC TCC CCC CCC CTC CAC CTC CTA 384 
Leu Lye Clu Pro Ciu Hia Pro Ser Met Trp Cly Pro Vel Clu Leu Vel 
115 120 125 

GCC ATC ATC CCC CCC CCC CTC TTC CTC CTC TTC CTC ATC ATC ATC ATT 432 
Cly lie lie Ale Gly Pro Val Phe Leu Leu Phe Leu He He He Zle 
130 135 140 

CTT TTC CTT CTC ATT AAC TAT CAT CAC CCT CTC TAT CAC AAC CCC CAG 480 
Val Phe Leu Val He Asn Tyr His Gin Arg Val Tyr Hia Asn Aro Gin 
145 150 155 160 

AG A CTG CAC ATC GAA CAT CCC TCA TCT CAG ATC TCT CTC TCC AAA CAC 528 
Arg Leu A«p Met Glu Asp Pro Ser Cya Glu Met Cye Leu Ser Lys Asp 
165 170 175 

AAC ACC CTC CAC CAT CTT CTC TAC CAT CTC TCC ACC TCA CCG TCT CCC 576 
Lys Thr Leu Cln Asp Leu Val Tyr Asp Leu Ser Thr Ser Gly Ser Gly 
180 185 190 

TCA CCC TTA CCC CTC TTT CTC CAC CGC ACA CTC GCC CCA ACC ATC CTT 624 
Ser Cly Leu Pro Leu Phe Vel Cln Arg Thr Vel Ale Arg Thr He Vel 
195 200 205 

TTA CAA GAG ATT ATT CCC AAC CCT CCC TTT CCC GAA CTA TCC CCG CCC 672 
Leu Gin Clu Zle Zle Cly Lys Cly Arg Phe Cly Glu Val Trp Aro Civ 
210 215 220 



WO 94/11502 
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720 



768 



816 



CCC TCC AGO OCT OCT CAT CTO OCT OTO AAA ATA TTC TCT TCT OCT CAA 
Arg Trp Arg Cly Oly Asp Val Alt Val Lye Zlt Ph. Sar Ser Are Olu 
225 230 23$ 240 

CAA CCC TCT TCC TTC AGO CAA OCA CAO ATA TAC CAC ACS CTC ATC CTO 
Glu Arg ser Trp Pha Arg Olu Ala Olu XI* Tyr Ola Thr Val Met Leu 
24S 250 255 

CCC CAT CAA AAC ATC CTT CCA TTT ATT OCT OCT CAC AAT AAA CAT AAT 
Arg Bis Clu Asa Xla Lea Oly Ph. xi* Ala Ala Asp Asa Lys Asp Asa 
260 265 270 

£2° 0X0 0X0 900 011 CTT CAC TAT CAT CAC CAC CCC 864 

Cly Thr Trp Thr Gin Leu Trp Lau Val Sar Asp Tyr Bis Glu Bis Cly 
275 280 285 

TCC CTC TTT CAT TAT CTC AAC CCC TAC ACA CTC ACA ATT CAC CCC ATC 
Sar Lau Pha Asp Tyr Leu Asn Arg Tyr Thr Val Thr Xla Glu Cly Mat 
290 295 300 

ATT AAC CTC GCC TTC TCT GCT CCT ACT CCC CTO CCA CAC CTC CAC ATC 
I la Lys Leu Ala Lau Sar Ala Ala Sar Cly Lau Ala Bis Lau Bis Mat 

305 310 315 3 20 

GAG ATC CTC CCC ACC CAA CGO AAC CCT CCA ATT CCT CAT CCA CAC TTA 
Glu Zla Val Cly Thr Gin Gly Lys Pro Gly Xla Ala Bia Arg Asp Lau 
325 330 335 

AAC TCA AAC AAC ATT CTC CTC AAC AAA AAT GCC ATC TCT CCC ATA CCA 
Lys Ser Lys Asn Xla Lau Val Lys Lys Atn Gly Mat Cys Ala Xla Ala 
340 345 

CAC CTC CCC CTC CCT CTC CCT CAT CAT CCA CTC ACT CAC ACC ATT CAC 
Asp Leu Cly Leu Ala Val Arg Bia Asp Ala Val Thr Asp Thr Xla Asp 
355 360 3£5 

ATT CCC CCC AAT CAC ACC CTC GCC ACC AAA CCA TAC ATC CCC CCT CAA 

X1 * A i! Pro ABn Cln *** V41 cl * Thr **q Tyr Met Ala Pro Clu 
370 375 3S0 

CTA CTT CAT CAA ACC ATT AAT ATC AAA CAC TTT CAC TCC TTT AAA TCT 
Val Leu Asp Clu Thr Xle Asn Mot Lys Bis Phe Asp Ser Phe Lys Cys 
385 390 395 400 

CCT CAT ATT TAT CCC CTC GGG CTT CTA TAT TCC GAG ATT CCT CCA ACA 1248 
Ala Asp lie Tyr Ala Leu cly Leu Val Tyr Trp Clu He Ala Arg Aro 
405 410 415 * 

TCC AAT TCT GCA CCA CTC CAT CAA CAA TAT CAC CTC CCA TAT TAC CAC 1298 
Cys Asn Ser Gly Gly Val Bis Glu Clu Tyr Gin Leu Pro Tyr Tyr Aso 
«20 425 430 

TTA CTC CCC TCT GAC CCT TCC ATT GAG CAA ATC CCA AAC CTT CTA TCT 1341 
Leu Val Pro Ser Asp Pro Ser Xle Clu Glu Met Arg Lya Val Val Cys 
435 440 445 



912 



960 



1008 



10S6 



1104 



1152 



1200 



CAT CAC AAC CTC CCT CCC AAC ATC CCC AAC TCC TCC CAC ACT TAT CAC 
Asp Cln Lye Leu Arg Pro Asn Xle Pro Asn Trp Trp Cln Ser Tyr Glu 
450 455 460 



1392 



#»• Iff* 
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W 53 W 

w 47 * 480 

WC C« CCC CCC CTC AOO CCC CIO C6C AM AAO *»B irr «y *v. 

Cly Ale AU Arg jj« Thr AU X-»2JSgI5!Sf2SS! 

* Ba 490 495 

CTC ACC CTC CAO 6AA GAC CTC AAC ATC TAACTOCTee rTrrjn T ^i.i 
Leu Ser Val Ola Clu A.p v*l Lye J5 IAACTCCTCC «CTCTCCAC 153s 
500 $05 

ACGCAGCTCC IGCCAOCCAO AACXACCCAC ACCIOCCCCC TTGAGCCTAC GAXOCAGCCC 
TACCTCTCGI TTCTCCCCAO CCCTCTCTCO CCAGCA6CCC TGCCCCGCAA CAOCGACACA 
CCCCCCCAGA CACTCCCTCA CTCCCATCTT CCCTTTCACA CAGACACCIT TTCTATTTAC 
CTCCTAATOC CATCCACACT CICAGAGCCA ATTCTCTCCA CAACTCACTO CCACACCTCC 
AACTGCXIOT AGTCCGAAGT CCCCCCAAAC CCGOTCCATC TCCCACOTCO CCAGCAGCCA 
TCACAGGGCC GCTTGGGAGG GCCCGCAGCA ACCCAGCIGI TGCCAGICCI AACCTOCCCT 

CAcocrrrcc ticgcggacc accccacaoc acaccaaggt gccccggaag aaccagaact 

CCACCCCCTC TCACAGGCAO CTCTCACCCC CCCTTTCCCC TCCTCCCTGG CATCCACCCT 
GCCGOGAGAC IGCCACTGOA CACCOAATCT CCCGCTTTGT CTGTCCAOCC CIOTGI6CAT 
GTOCCCAGGI GCCTCCCCCG TTGTGCCTOG TTCCTCCCAT CCCCTTACAC GTGCCTGTCA 
GTGTGTGTCT CTCICIGIAG CTGOGCACTT ACCTCCTTCA CCTTTCTCTO CATCTCCACG 
TCGGCGGTGT GGTCGTCATG CTCTCCGTGC TTCCTGCTCC CTCTTTTCAG TACTCACCAC 
CATCTAGTTT CCCTOCTGCC CTTCCCTOGA GGICTCTCCC TCCCCCAGAG CCCCTCA1CC 
CACAGTOGTA CTCTGTGT 

(2) XNFORKAIIOH FOR SZQ ZD NOt 8: 

(i) SEQUEHCE CHARACTERISTICS: 

(A) LENGTHt 505 aaino fields 

(B) TYPE: amino acid 
(0) TOPOLOCr: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQOENOB DESCRIPTION t SEQ ID NOt 6> 

Met Aim Glu Ser Ale Cly Al» Ser S.r Ph. Ph. Pro Leu Vai V.i 

Leu Leu AU Gly Ser Cly cly Ser Gly Pro Arg Gly Val cm AU Leu 

25 30 



1595 

1655 

1715 

1775 

1835 

1895 

1955 

2015 

2075 

2135 

219S 

2255 

2315 

2333 
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Leu cy« Alt Cys Thr Ser Cys Leu oin Ala Asn Tyr Thr Cys Olu Thr 
35 40 45 

Asp Cly Ala Cys Mat Val Sar Ph. Ph. Asn Lsu Asp Cly Mat Clu Bis 

55 £Q 

Bit Val Arg Thr cys II. Pro Lys Val clu Leu Val Pro Ala Cly Lys 

65 70 7$ 80 

Pro Ph. tyr Cys Leu s.r S.r Clu Asp Lau Arg Asn Thr Bis Cys Cys 

85 90 95 ' 

Tyr Thr Asp Tyr Cys Asn Arg lis Asp Leu Arg Val Pro Ssr Cly Bis 

100 105 no 

Leu Ly. Clu Pro Clu Bis Pro S.r Mat Trp Cly Pro Val Clu Lau Val 

AiS 120 125 

Cly II. 21a Ala Cly Pro Val Ph. La U Lau Phs Lau lis Ila Ila II. 

"° 135 140 

Val Phe Leu Val 11. Asn Tyr Hi. Gin Arg Val Tyr Hi. Asn Arg Cln 

" S 150 155 160 

Arg Leu Asp Met Clu Asp Pro Ser Cys Clu Met Cys Lau Ser Lys Asp 

165 170 175 * 

Ly. Thr Leu Cln Asp Leu Val Tyr Asp Leu Ser Thr Sar Cly Sar Cly 

180 185 190 

Ser Cly Leu Pro Leu Phe Val Cln Arg Thr Val Ala Arg Thr lla Val 

195 200 205 

Leu Cln Clu Ila Ila Cly Lys Cly Arg Phe Cly Clu Val Trp Arg Cly 

* i0 215 220 

Arg Trp Arg Cly Cly Asp Val Ala Val Lys lie Phe Ser Sar Arg Clu 

225 230 235 240 

Clu Arg Ser Trp Phe Arg Clu Ala Clu He Tyr Cln Thr Val Met Leu 
2 * 5 250 255 

Arg His Clu Asn He Leu Cly Phe II. Ala Ala Asp Asn Lys Asp Asn 

260 265 270 

Cly Thr Trp Thr Cln Leu Trp Leu Val Ser Asp Tyr Hi. Clu Hi. cly 

275 280 285 

Ser Leu Phe Asp Tyr Leu Asn Arg Tyr Thr Val Thr He Clu Cly Met 
290 295 300 

lie Ly. Leu Ala Leu Ser Ala Ala Ser Cly Leu Ala His Lau Bis Mat 

310 315 320 

Clu Ila Val Cly Thr Cln Cly Lys Pro Cly He Ala Bis Arg Asp Lau 

325 330 335 

Lys Ser Lys Asn Ila Leu Val Lys Lys Asn Cly Mat Cys Ala Ila Al. 

3*0 345 * 350 *■ **» 



WO 94/11502 —L 
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A.p f u Cly L.„ Ala V.l Ar, Hi. A.p Al. V.l Thr A.p Thr XI. A.p 

II. Ala Pro A.b Cln Ar, V.l Cly Thr Ly. Arg Tyr let AU Pro Clu 

u 375 38O 

Vjl L.u A.p Clu Thr II. A.n K.t Ly. Hi. Ph. A.p S.r Ph. Ly. Cy. 

395 400 

Al. A.p 21. Tyr Ala U« Cly L.u V.l Tyr Trp Clu II. Ala Arg Arg 
405 410 4il * 

Cy. Aan S.r Cly Cly v.l Hi. clu clu Tyr Cln L.u Pro Tyr Tyr A.p 

425 430 

L.u V.l Pro S.r A.p Pro S.r II. oiu Clu K.t Arg Ly. v.l Val Cy. 

440 445 

A.p Cln Ly. L.u Arg Pro A.n II. Pro A.n Trp Trp Cln S.r Tyr Clu 

455 450 

Jlj Leu Arg Val Hat Cly Ly. Met Mat Arg Clu Cy. Trp Tyr Ala A.n 

u 475 480 

Cly Ala Ala Arg Leu Thr Ala Leu Arg He Ly. Ly. Thr Leu S.r Cln 
4B5 490 495 

Leu ser Val Cln Clu A.p Val Ly. He 

500 sos 

(2) INFORMATION FOR SEQ ID NOt 9s 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH; 2308 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOcr: linear 

(ii) MOLECULE TYPE I CONA 

(iii) HYPOTHETICAL! NO 

(iii) ANT I -SENSE J NO 

(v) FRAGMENT TYPE* internal 

(vi) ORIGINAL SOURCE x 

(A) ORGANISMS Mouse 

(ix) FEATURE; 

(A) NAME /KEY: CDS 

(B) LOCATIONS 77.. 1585 

(xi) SEQUENCE DESCRIPTION; SEQ ID NOs 9s 
CCCCAGCCGA CGTTTCCTCC CCTCAGCCAC CCCCCCCCCC CCCCCCCCCC CCCCCACACC 60 
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CCGTGOCCCC CCCACC AT6 CAO CCC COO CIC CCI CCT COO OCT CCC CCO 
M.t Clu Ala Al. V.l Al* Al. Pro Arg Pro Arg 
* 5 10 

CTG CTC CTC CTC CTC CTG COG COC CCO CCC CCC CCO CCC CCC COG CTG 
L6U L9U Vftl L « U Al. Al. Al. Al. Al. iS Si Su 

15 2 0 25 



55 



109 



157 



205 



SS JS 5?° P* « C «T TTC TCC CAC CTC TCT ACA AAA 
L«u Pro Cly Al. Thr Al. L.u cin Cy. Ph. Cy. Hi. l» u Cy. Thr Ly. 

30 35 40 

CAC AAT TTT ACT TCT CTC ACA CAT CCC CTC TCC TTT CTC TCT CTC ACA 3*n 
A.p A.n Ph. Thr Cy. V.l Thr A.p Cly L.u Phi SI? IS 51? *" 



? AC 5?° £5* ? XC *** CIT ATA 0X0 AAC ACC ATC TCT ATA CCT CAA ATT 
Clu Thr Thr A.p Ly. V.l XI. Hi. A.n S.r M.t Cyl S A?I ell S 

60 *5 70 75 

CAC TTA ATT CCT CCA CAT ACC CCC TTT CTA TCT CCA CCC TCT TCA AAA 
Asp Leu lie Pro Arg A.p Arg Pro Ph. V.l gl S £1 IS IS iJJ 
80 85 go 

iS G^v IS 51? tS f£* I AT TCC AAT CAC CAC CAT TCC AAT 

Thr Cly ser V.l Thr Thr Thr Tyr Cy. Cy. A.n cin A.p Hi. Cy. A.n 

ss 100 105 

AAA ATA CAA CTT CCA ACT ACT CTA AAC TCA TCA CCT CCC CTT CCT CCT 
Ly. lie Clu Leu Pro Thr Thr V.l Ly. Ser sS p£ SS SI ctj £1 
ii0 115 12 o 

CTC CAA CTC CCA CCT CTC ATT CCT CCA CCA CTC TCC TT etc far i+r> 
V.1 Clu Leu Al. Al. V.1 21. Al. Cly So* 51? £1 HI SI? S 

130 135 

TCA CTC ATC TTC ATC CTC TAT ATC TCC CAC AAC CCC ACT CTC ATT rxr 
ser Leu Met Leu Met y.l Tyr II. Cy. Hi. A.n Sf JS SI? J" SI 
140 145 150 155 

CAT CCA CTC CCA AAT CAA CAC CAC CCT TCA TTA CAT CCC CCT m m 
Hi. Arg V.l Pro A.n Clu Clu A.p Pro S.r £J SJJ Arg Pro Phi 2 ll 
160 "5 170 

TCA CAC CCT ACT ACC TTC AAA CAC TTA ATT TAT CAT ATC ACA A«s *<-.m 
Ser Clu Cly Thr Thr Leu Ly. A.p Leu iS ?! JS Jfi Jg Jg 
175 180 185 

CCT TCT CCC TCA CCT TTA CCA TTC CTT CTT CAC ACA ACA ATT CCC aci 
Cly ser Cly S.r Cly Leu Pro Leu Leu V.l cin IrJ JS S £. Eg* 

195 200 

ACT ATT CTC TTA CAA CAA ACC ATT CCC AAA CCT OCA TTT CCA en 
Thr 21. V.1 Leu Cin Clu S.r 11. Cly Ly. ClJ SJ JJI ctj cfi SIT 

2X5 

TCC ACA CCA AAC TCC CCC CCA CAA CAA CTT CCT CTT AAC ATA TTC Tee ,o, 
Trp Arg Cly Ly. Trp Arg Cly Clu Clu V.l Al. V.l £. l2 "I IS 7 " 

225 230 235 



301 



349 



397 



445 



493 



541 



589 



637 



€85 



733 
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TCT AGA CAA CAA OCT TCC TOO TTC OCT CAC CCA CAC ATT TXT en* arrr 
S.r Arg Clu Clu Arg S.r Trp Ph. Arg Clu AU f" J£ gj iS 

CTA ATC TTA CCT CAT CAA AAC ATC CTC CCA TTT ATA CCA crx nxr »<p 

vi M. t l. u at. Hi. ciu A. n :i. SI Jli 2S EI 

260 jgs 

AAA CAC AAT COT ACT TOO ACT CAC CTC TCC TTO CTC TCA CAT TAT CAT 
Ly. A«p A.n Cly Thr Trp Thr Cln Uu Trp L.u vil 12 2J ™ SI 

270 275 ago 

SI IS 21 IS £ c P A f* 0 A0X TAC ACA 6TT ACT CTC 

3« y * ha A !S **" A,B **9 Tyr Thr Val Thr Val 

290 295 

GAA COA ATC ATA AAA CTT CCT CTC TCC ACC COS ACC OCT CTT r.rr n«i> 
Clu Cly M.t II. Ly. L.u Al. L.u s.r £ SS 5« gj 21 EI SI 

Jua 310 

CTT CAC ATC CAC ATT CTT CCT ACC CAA CCA AAC CCA err *<r-f 
Leu Hi. M.t Clu 11. Val cly thr £y ™ J£ S JS S SI 
320 325 330 

AGA GAT TTG AAA TCA AAG AAT ATC TTC CTA AAC AAC xxr nrx *n*> 
Arg Asp Leu Ly. Ser Ly. A.n 11. £u vll f£ i£ £1 Si? 52 £1 
JJS 340 345 

TGT ATT CCA CAC TTA CCA CTC CCA CTA ACA CAT CAT TCA CCO AC* cm- 
Cy. 11. Al. A. P Leu Cly L.u Al. Val Arg SI IS SI JS SJ 

Jao 355 

ACC ATT CAT ATT CCT CCA AAC CAC ACA CTC CCA ACA AAA am Tin %^ 
Thr II. A. P U. Al. Pro A.n Hi. ArJ 51? Thr J£ ™ 51? 

?ff » CT S** 0X0 6AT 6AT TCC *™ AAT ATC AAA CAT TTT CAA TCC 

Al. Pro Glu V.l L.u A.p A.p S.r II. A.n M.t Ly. Hi! HI 85 I« 

380 385 390 395 

TTC AAA CCT CCT CAC ATC TAT CCA ATC GGC TTA CTA TTC TCC exx xr* 
Phe Ly. Arg Al. A.p II. Tyr Al. Het Cly 2i Si HI SJ iS 
400 405 * 410 



829 



877 



925 



973 



1021 



1069 



1117 



1165 



1213 



1261 



1309 



CCT CCA CCA TCT TCC ATT CCT CGA ATT CAT CAA CAT TAC CAA CTC CCT 
Ala Arg Arg Cy. S.r II. Cly Cly II. Hi. Clu A.p Tyr cfn Su SI 
4i5 420 425 

TAT TAT CAT CTT CTA CCT TCT CAC CCA TCA CTT CAA CAA Are xrx in 
Tyr Tyr A.p L«u Val Pro S.r A.p Pro S.r 5" cJJ cJJ J2 £J J£ 
* ia 435 440 

CTT CTT TCT CAA CAC AAC TTA ACC CCA AAT ATC CCA AAC ACA TCC exe 
Val Val Cy. Clu Cln Ly. Leu Ar, Pro A.n II. Pro JlS JSJ Tr? SS 

ACC TCT CAA CCC TTC ACA CTA ATC CCT AAA ATT ATC ACA CAA TCT TCC ien, 
Ser Cy. Clu Al. L.u Arg V.l H.t Ala Ly. II. m2 Zg 85 "I 1501 

465 470 



1357 



1405 



1453 
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TAT CCC AAT CCA CCA OCT ACC CTT ACA OCA TTC CGC ATT AAO AAA ACA 1549 
Tyr Ali Asn Gly Ala Ala Axg Leu Thr Ala Leu Arg Zle Lye Lye Thr 
480 485 490 



TTA TCC CAA CTC ACT CAA CAG GAA CGC ATC AAA ATG TAATTCTACA 1595 
Lau Ser Gin Leu Bar Gin Gin Glu Gly Xla Lya Mat 
495 500 



CCTTTCCCTG 


AACTCTCCTT 


TTTTCTTCAC 


ATCTGCTCCT 


GGGTTTTAAT 


TTGGGAGGTC 


1655 


AGTTGTTCTA 


CCTCACTGAG 


AGGGAACAGA 


AGGATATTGC 


TTCCTTT7GC 


AGCACTGTAA 


1715 


TAAACTCAAT 


TAAAAACTTC 


CCACCATTTC 


TTTCCACCCA 


GGAAACACCC 


ATCTGGGTCC 


1775 


TTTCTGTGCA 


CTATGAACGC 


TTCTTTCCCA 


GGACAGAAAA 


TGTGTACTCT 


ACCTTTATTT 


1835 


TTTATTAACA 


AAACTTGTTT 


TTTAAAAAGA 


TCATTCCTCC 


TCTTAACTTT 


AGGTAACTCT 


1895 


CCTCTCCTCC 


ACATCATCTT 


TAAGGGCAAA 


GCAGTTGCAT 


TGCTCAATTA 


CAATCAAACA 


1955 


TCTCTTATTA 


CTAAAGAAAC 


TCATTTACTC 


CTCCTTACTA 


CATTCTCACA 


GGATTCTCAA 


2015 


CCACTAGACT 


TTCCTTCATT 


cACAcrrrcA 


ATCTACTCTT 


CTATACTTTT 


TCACCATCTT 


2075 


AAAACTAACA 


CTTATAAAAC 


TCTTATCTTG 


ACTCTAAAAA 


TCACCTCATA 


TACTAGTCAG 


2135 


CAACATAATT 


CATCCAATTC 


TATTTTGTAT 


ACTATTATTC 


TTCTTTCACT 


TATTCAGAAC 


2195 


ATT A CATC CC 


TTCAAAATCC 


CATTCTACTA 


TACCACTAAG 


TCCCACTTCT 


CTCTCTTTCT 


2255 


AATCCAAATC 


ACTACAATTG 


CTCAAAGTCT 


CTATGTTAAA 


ACCTATACTC 


TTT 


2308 



(2) INFORMATION FOR SEQ ZD NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 503 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linaar 

(ii) MOLECULE TYPE: pro tain 

(xi) SEQUENCE DESCRIPTION: SEQ ZD NO: 10: 

Net Glu Ala Ala Val Ala Ala Pro Arg Pro Arg Lau Lau Lau Leu Val 

15 10 15 

Leu Ala Ala Ala Ala Ala Ala Ala Ala Ala Leu Leu Pro Gly Ala Thr 
20 25 30 

Ala Leu Gin Cys Pha Cya Hia Leu Cys Thr Lya Aap Asn Fhe Thr Cya 
35 40 45 

Val Thr Asp Gly Leu Cya Pha Val Sar Val Thr Glu Thr Thr Asp Lya 

50 55 60 

Val He Hia Asn Ser Met Cya lie Ala Glu lie Asp Leu Zle Pro Arg 
65 70 75 80 
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Asp Arg Pro Phe Val Cys Ala Pro Ser Ser Lye Thr Qly Ser Val Thr 
85 90 95 

Thr Thr Tyr Cys Cys Asn Gin Atp His Cys Asn Lys He Glu Leu Pro 

100 105 110 

Thr Thr Val Lys Ser Ser Pro Gly Leu Cly Pro Val Glu Leu Ala Ala 

115 120 125 

Val I la Ala Gly Pro Val Cys Pha Val Cys I la Sar Leu Met Leu Met 

130 135 140 

Val Tyr He Cys His Asn Arg Thr Val He His His Arg Val Pro Asn 

145 150 155 160 

Glu Clu Asp Pro Ser Leu Asp Arg Pro Pha He Bar Glu Gly Thr Thr 
165 170 175 

Leu Lys Asp Leu He Tyr Asp Met Thr Thr Ser Gly Ser Gly Ser Gly 

180 185 190 

Leu Pro Leu Leu Val Gin Arg Thr He Ala Arg Thr He Val Leu Gin 
195 200 205 

Glu Ser He Gly Lys Gly Arg Phe Gly Clu Val Trp Arg Gly Lys Trp 
210 215 220 

Arg Gly Glu Glu Val Ala Val Lys He Phe Ser Ser Arg Glu Glu Arg 

225 230 235 240 

Ser Trp Phe Arg Glu Ala Glu He Tyr Gin Thr Val Met Leu Arg His 
245 250 255 

Glu Asn He Leu Gly Phe He Ala Ala Asp Asn Lys Asp Asn Gly Thr 

260 265 270 

Trp Thr Gin Leu Trp Leu Val Ser Asp Tyr His Glu His Cly Ser Leu 
275 280 285 

Phe Asp Tyr Leu Asn Arg Tyr Thr Val Thr Val Clu Gly Met He Lys 

290 295 300 

Leu Ala Leu Ser Thr Ala Ser Gly Leu Ala His Leu Bis Met Glu He 

305 310 315 320 

Val Cly Thr Gin Gly Lys Pro Ala He Ala His Arg Asp Leu Lys Ser 
325 330 335 

Lye Asn He Leu Val Lys Lys Asn Gly Thr Cys Cys He Ala Asp Leu 

340 345 350 

Cly Leu Ala Val Arg His Asp Ser Ala Thr Asp Thr He Asp He Ala 
355 360 365 

Pro Asn His Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu Val Leu 
370 375 380 

Asp Asp Ser He Asn Met Lys Hie Phe Glu Ser Phe Lys Arg Ala Asp 

385 390 395 400 
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lie Tyr Ala Met Gly Lmu Val Fhe Trp Olu lie Ala Arg Arg Cys Mr 
40S 410 415 

lie Gly Gly lie His Glu Asp Tyr Ola Leu Fro Tyr Tyr Asp Leu Val 
420 425 430 

Pro Ser Asp Fro Ser Val Glu Glu Kst Arg Lys Val Val Cys Glu Gin 
435 440 445 

Lys Leu Arg Fro Asn lis Fro Asn Arg Trp Gin Ssr Cys Glu Alt Leu 
450 455 460 

Arg Val Met Aim Lys lis Kst Arg Glu Cys Trp Tyr Als Asn Gly Ala 
465 470 475 460 

Ala Arg Leu Thr Ala Lau Arg lis Lys Lys Thr Leu Ssr Gin Lsu Ssr 
485 490 495 

Gin Gin Glu Gly lis Lys Kst 

500 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS J 

(A) LENGTH: 1922 bass pairs 

(B) TYPEs nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(il) MOLECULE TTFE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mouss 

(ix) FEATURE: 

(A) NAME /KEY: COS 

(B) LOCATION: 241.. 1746 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GAGAGCACAG CCCTTCCCAG TCCCCGCAGC CCCCCCCCGA CGCGCGCATG ATCAAGACCT. 60 

TTTCCCCGGC CCCACAGGGC CTCTGGACGT GAGACCCCGG CCGCCTCCGC AAGGAGAGGC 120 

GGGGGTCGAG TCGCCCTGTC CAAAGGCCTC AATCTAAACA ATCTTGATTC CTGTTGCCGG 180 

CTGGCGGGAC CCTCAATGGC AGGAAATCTC ACCACATCTC TTCTCCTATC TCCAAGGACC 240 

ATG ACC TTG GGG AGC TTC AGA AGG GGC CTT TTG ATG CTG TCG GTG GCC 288 
Met Thr Leu Gly Ser Phs Arg Arg Gly Leu Leu Met Leu Ser Val Ala 
1 S 10 15 
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CTC CAG AGC ACC CTA OCT CGC CAC CTT CCC CTC CTA CAG TCT CTC CCA 
Val Gin Arg Thr Val Ala Arg Gin Val Ala Leu Val Clu Cye Val Civ 
195 200 205 

AAG GGC CGA TAT GGC GAG GTG TGC CCC CCT TCC TCC CAT CCC CAA ACC 

Lya Gly Arg Tyr cly Clu Val Trp Arg Cly Sar Trp His Cly clu Sar 
210 215 220 

CTC CCC CTC AAC ATT TTC TCC TCA CCA CAT CAC CAC TCC TCC TTC CGC 
Val Ala Val Lya XI* Phe Sar Sar Arg Asp Clu Gin Sar Trp Phe Aro 
22S 230 235 240 

GAG ACG CAC ATC TAC AAC ACA CTT CTC CTT AGA CAC CAC AAC ATC CTA 
Glu Thr Glu Zle Tyr Am Thr Val Lau Leu Arg Hit Asp Asn lie Leu 
24S 250 255 



4 BO 



61 

TTC CCC CTA ACC CAO CCC AGA CTT CCC AAO CCT TCC AAC CTC CTC AAC 
Lau Cly Leu Thr Gin Cly Arg Leu Ala Lye Pro Sar Lya Leu Val Aan 
20 25 30 

TCC ACT TCT CAC ACC CCA CAC TCC AAC ACA CCA TTC TCC CAC CGC TCA 3fli 
Cya Thr Cye Clu Sar Pro Hia Cya Lya Arg Pro Phe Cya Cln Cly Ser 
35 40 45 

1°° I?° £5* S 1 ? CTC 070 0TT «*« CAG GGC AGC CAC CCC CAC CTC 432 

Trp Cya Thr Val Val Leu Val Arg Clu Cln Cly Arg Hie Pro Cln Val 
50 55 60 

TAT CGC GGC TCT CGC AGC CTC AAC CAC CAC CTC TCC TTC CCA CCT CCC 
Tyr Arg cly Cya Cly Sar Leu Aan Cln Clu Lau Cya Leu cly Arg Pro 
65 70 75 eo 

ACG CAG TTT CTC AAC CAT CAC TCC TGC TAT ACA TCC TTC TCC AAC CAC 528 
Thr Clu Phe Leu Aan Hia Hio Cya Cya Tyr Arg Ser Phe Cv« Aan Hi? 

85 90 * 95 

AAC CTC TCT CTC ATC CTC CAG CCC ACC CAA ACT CCT TCC CAC CAC CCA 
Aan Val Ser Leu Met Lau Clu Ala Thr Gin Thr Pro Ser Clu Clu Pro 
100 105 no 

GAA CTT GAT CCC CAT CTG CCT CTC ATC CTC CCT CCT CTC CTC CCC TTC 
Glu Val Asp Ala Hia Leu Pro Leu lie Leu Cly Pro Val Leu Ala Leu 
115 120 125 

CCG CTC CTG GTG CCC CTG CCT GCT CTG CGC TTC TGC CCT CTC CGC CCC S73 
Pro Val Leu Val Ala Leu Gly Ala Leu Cly Leu Trp Arg Val Arg Aro 
130 135 240 

AGC CAG CAG AAC CAG CCC CAT TTC CAC ACT CAC CTG CCC CAC TCC ACT 
Arg Cln Glu Lya Gin Arg Aap Leu Hia Ser Aap Leu Cly Clu Ser Ser 
145 150 iss 16Q 

CTC ATC CTC AAC CCA TCT CAA CAC CCA CAC AGC ATC TTC CCC CAC TTC 
Leu He Leu Lya Ala Ser Clu Cln Ala Aap Ser Met Leu Cly Asp Phe 
165 170 * 175 

CTG CAC ACC CAC TCT ACC ACC CCC AGC CGC TCC CGC CTC CCC TTC TTC filfi 
Leu Asp ser Asp Cys Thr Thr Cly Ser Cly Ser Cly Leu Pro Phe Leu 
180 185 190 



576 



624 



720 



768 



664 



'912 



960 



1008 
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ccc rrc atc ccc tcc cac ato act tco cgg aac toq acc acc cao ctc lose 

Cly Pha Zla Ala Ser Alp Mat Thr Ser Arg Aan Sar Ser Thr Olo Leu 

260 265 270 

TCC CTC ATC ACC CAC TAC CAT GAA CAC CCC TCC CTC TAT CAC TTT CTC 1104 
Trp Leu Zla Thr His Tyr Bit Clu His Cly Sar Leu Tyr Aap Pha Leu 
275 280 285 

CAG ACC CAC ACC CTC CAC CCC CAC TTC CCC CTC ACC CTA OCT CTC TCC 1152 
Gin Arg Gin Thr Lau Clu Pro Gin Lau Ala Lau Arg Lau Ala Val Sar 
290 295 300 

CCC GCC TCC GGC CTC CCC CAC CTA CAT CTC CAC ATC TTT CCC ACT CAA 1200 
Pro Ala Cya Gly Lau Ala fiia Lau Hia Val Glu 21a Pha Gly Thr Gin 
305 310 315 320 

CCC AAA CCA GCC ATT CCC CAT CCT CAC CTC AAG ACT CCC AAT CTC CTC 1248 
Gly Lys Pro Ala Ila Ala Hia Arg Aap Lau Lya Sar Arg Aan Val Leu 
325 330 335 

CTC AAC ACT AAC TTC CAC TCT TCC ATT CCA CAC CTC CGA CTC CCT CTC 1296 
Val Lys Sar Asn Lau Gin Cya Cys Zla Ala Asp Lau Gly Lau Ala Val 
340 345 350 

ATC CAC TCA CAA ACC AAC CAC TAC CTC CAT ATC CCC AAC ACA CCC CCA 1344 
Met His Ser Gin Ser Asn Clu Tyr Lau Aap Zla Gly Asn Thr Pro Arg 
355 360 365 

CTC CGT ACC AAA ACA TAC ATC CCA CCC CAC CTC CTC CAT CAG CAC ATC 1392 
Val Gly Thr Lys Arg Tyr Mat Ala Pro Clu Val Lau Asp Glu His Zla 
370 375 380 

CCC ACA CAC TCC TTT CAG TCC TAC AAG TCG ACA CAC ATC TCC GCC TTT 1440 
Arg Thr Asp Cys Pha Glu Sar Tyr Lya Trp Thr Aap Zla Trp Ala Pha 
385 390 395 400 

CCC CTA CTC CTA TCC CAG ATC CCC CCC CCC ACC ATC ATC AAT CCC ATT 1488 
Gly Leu Val Lau Trp Glu Zla Ala Arg Arg Thr Zlo Zla Asn Gly Zla 
405 410 415 

CTC CAG GAT TAC ACG CCA CCT TTC TAT CAC ATC GTA CCC AAT CAC CCC 1536 
Val Glu Asp Tyr Arg Pro Pro Pha Tyr Aap Mat Val Pro Asn Asp Pro 
420 425 430 

AGT TTT GAG GAC ATC AAA AAG CTC CTC TCC CTT GAC CAG CAG ACA CCC 1564 
Ser Pha Clu Asp Mat Lya Lya Val Val Cya Val Asp Gin Gin Thr Pro 
435 440 445 

ACC ATC CCT AAC CCC CTC CCT CCA GAT CCC CTC CTC TCC GCC CTC GCC 1632 
Thr Zla Pro Asn Arg Lau Ala Ala Asp Pro Val Lau Sar Gly Lau Ala 
450 455 460 

CAG ATC ATG AGA GAG TCC TCG TAC CCC AAC CCC TCT OCT CCC CTC ACC 1680 
Gin Met Met Arg Glu Cys Trp Tyr Pro Aan Pro Sar Ala Arg Lau Thr 
465 470 475 480 

CCA CTC CCC ATA AAG AAG ACA TTC CAG AAG CTC AGT CAC AAT CCA CAG 1728 
Ala Leu Arg Zla Lys Lvs Thr Lau Gin Lya Lau Sar Hia Asn Pro Glu 
485 490 495 
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AAO CCC AAA CTO ATT CAC TAGCCCAGGG CCACCACCCT TCCTCTCCCT 1776 
Lys Pro Lys Val He Bis 
500 

AAACTGTGTC CTGGGCAAGA ACACATACCC TGTCTGGGTA GAGGGAGTGA ACACACTCTC 1836 
CACGCTGCCC TCTCTCTCCC TCCTCACCTT GCTCCCAGCC CATCCACCCA AAAATACAGC 1896 
TGAGCTCAAA TTCAAAAAAA AAAAAA 



(2) INFORMATION FOR SEQ ID NO: 12 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 502 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protain 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Mat Thr Lau Cly Sar Pha Arg Arg Cly Lau" Lau Mat Lau Sar Val Ala 

15 10 15 

Leu Gly Leu Thr Gin Gly Arg Lau Ala Lya Pro Sar Lys Lau Val Asn 

20 25 30 

cys Thr Cye Giu Ser Pro His Cys Lys Arg Pro Phe Cys Gin Gly Scr 
35 40 45 

Trp Cys Thr Val Val Lau Val Arg Glu Gin Cly Arg His Pro Gin Val 
50 55 60 

Tyr Arg Cly Cys Cly Ser Lau Asn Gin Glu Lau Cys Lau Gly Arc Pro 
65 70 75 80 

Thr Glu Phe Leu Asn His His Cys Cys Tyr Arg Ser Pha Cys Asn His 

85 90 95 

Asn Val Ser Lau Mat Leu Glu Ala Thr Gin Thr Pro Ser Glu Glu Pro 
100 105 HO 

Glu Val Asp Ala His Leu Pro Leu lie Leu Gly Pro Val Leu Ala Leu 

115 120 125 

Pro Val Leu Val Ala Leu Cly Ala Leu Gly Leu Trp Arg Val Arg Arc 
130 135 140 

Arg Gin Glu Lys Gin Arg Asp Leu His Ser Asp Leu Cly Glu Ser Ser 
"5 150 155 160 

Leu He Leu Lys Ala Ser Glu Gin Ala Asp Ser Met Leu Gly Asp Phe 
165 170 175 

Leu Asp Ser Asp Cys Thr Thr Gly Ser Gly Ser Cly Leu Pro Phe Leu 
160 185 190 



1922 



ri inrrrn iTr rurrr 
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Val Cln Arg Thr Val Ala Arg Cln Val Ala Leu Val Clu Cys Val Cly 

195 200 205 

Lys Cly Arg Tyr Cly Clu Val Trp Arg Cly Ser Trp His Cly Clu Ser 
210 215 220 

Val Ala val Lya 11a Pha Sar Ser Arg Asp Clu Cln Ser Trp Pha Arc 

225 230 235 240 

Clu Thr Clu Ila Tyr Acn Thr Val Lau Lau Arg His Aap Aan Zla Lau 
245 250 255 

Cly Pha Ila Ala Sar Asp Mat Thr Sar Arg Asn Sar Sar Thr Cln Lau 

260 265 270 

Trp Leu Zla Thr His Tyr His Clu His Cly Sar Lau Tyr Asp Pha Lau 

275 280 285 

Cln Arg Cln Thr Lau Clu Pro Cln Lau Ala Lau Arg Lau Ala VaI Sar 

290 295 300 

Pro Ala Cy» Gly Lau Ala His Leu His Val Clu Ila Pha Cly Thr Cln 
305 310 315 320 

Cly Lys Pro Ala Ila Ala His Arg Asp Lau Lys Sar Arg Asn Val Leu 

325 330 335 

VaI Lys Sar Asn Lau Cln Cys Cys Ila Ala Asp Lau Cly Lau Ala Val 

340 345 350 

Met His Ser Cln Ser Asn Clu Tyr Lau Asp Ila Cly Asn Thr Pro Arg 
355 360 365 

Val Cly Thr Lys Arg Tyr Met A1a Pro Clu VaI Lau Asp Clu His Ila 
370 375 380 

Arg Thr Asp Cys Pha Clu Sar Tyr Lys Trp Thr Asp Ila Trp Ala Pha 

385 390 395 400 

Cly Leu Val Lou Trp Clu lie Ala Arg Arg Thr Ila Ila Asn Cly lie 
405 410 415 

Val Clu Asp Tyr Arg Pro Pro Pha Tyr Asp Mat Val Pro Asn Asp Pro 
420 425 430 

Ser Phe Clu Asp Met Lys Lys Val Val Cys Val Asp Cln Cln Thr Pro 
435 440 445 

Thr lie Pro Asn Arg Leu Ala Ala Asp Pro Val Lau Sar Cly Leu Ala 
450 455 460 

Cln Met Met Arg Clu Cys Trp Tyr Pro Asn Pro Sar Ala Arg Lau Thr 
465 470 475 480 

Ala Leu Arg He Lys Lys Thr Leu Cln Lys Leu Ser His Asn Pro Clu 
485 490 495 

Lys Pro Lys Val Ila Hia 

500 
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(2) INFORMATION FOR SEQ ID HO J 13: 




65 



(i> SEQUENCE CHARACTERISTICS 1 

(A) LENGTH i 2070 base pairs 

(B) TYPE i nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY t linaar 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL I NO 

(iii) ANTI-SENSE t NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mouaa 

(iz) FEATURE: 

(A) NAME /KEY : COS 

(B) LOCATION: 217.. 1812 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

ATTCATCAGA TGCAAGCATA CGTCAAAGCT GTTCGGAGAA ATTGCAACTA CAGTTTTATC 

TAGCCACATC TCTCACAATT CTGAAGAAAC CACCACCTCA AAGTCATTCC CAAGTCATTT 

TGTTCTCTAA GCAAGCCTCC CTCATTCACT TACACCAGTG AGACACCAGC ACCAOTCATT 

CAAAGCCCCG TGTACAGGAC GCGTGGCAAT CAGACA ATO ACT CAG CTA TAC ACT 

Mat Thr Gin Lau Tyr Thr 
1 5 

TAC ATC AG A TTA CTG CCA GCC TCT CTG TTC ATC ATT TCT CAT GTT CAA 
Tyr lie Arg Lau Lau Gly Ala Cys Lau Phe Ila Ila Sar Hit Val Cln 

10 " 20 

GGG CAG AAT CTA GAT ACT ATC CTC CAT GCC ACT GOT ATC AAA TCA CAC 
Gly Cln ABn Leu Aep Ser Met Leu Hi* Gly Thr Gly Met Lya Ser Asp 
25 30 35 r 

TTC GAC CAC AAC AAG CCA GAA AAT CCA CTG ACT TTA CCA CCA CAC CAT 
Leu Asp Cln Lye Lye Pro Clu Asn Cly Val Thr Leu Ala Pro Clu Aep 
40 45 50 r 

ACC TTC CCT TTC TTA AAG TGC TAT TCC TCA CCA CAC TCC CCA CAT CAT • 
Thr Leu Pro Phe Leu Lye Cys Tyr Cye Ser Cly His Cye Pro Asp Asp 
55 60 65 70 

CCT ATT AAT AAC ACA TCC ATA ACT AAT GCC CAT TCC TTT CCC ATT ATA 
Ala He Asn Asn Thr Cye He Thr Aen Cly His Cys Phe Ala He lie 
75 80 85 

GAA CAA CAT CAT CAG CCA CAA ACC ACA TTA ACT TCT GCG TCT ATC AAC 
Glu Glu Asp Asp Cln Gly clu Thr Thr Leu Thr Ser Cly Cys Met Lya 
90 95 ioo 



60 
120 
180 
234 

282 

330 

378 

426 . 

474 

522 
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TAT CAA CCC TCT GAT TTT CAA TCC AAG GAT TCA CCC AAA CCC CAO CTA 
Tyr Glu Gly Ssr Asp Phs Gin Cys Lys Asp Smr Pro Lys Aim Gin Lsu 
105 110 115 

CCC ACC ACA ATA CAA TCT TCT CCC ACC AAT TTC TCC AAC CAC TAT TTC 
Arg Arg Thr 21s Clu Cys Cys Arg Thr Asn Lsu Cys Asn Gin Tyr tmu 
120 125 130 

CAC CCT ACA CTC CCC CCT CTT CTT ATA GCT CCC TTC TTT CAT CCC ACC 
Gin Pro Thr Lsu Pro Pro Val Val lis Gly Pro Phs Phs Asp Gly 5sr 
135 140 145 150 

ATC CGA TGC CTC CTT CTC CTC ATT TCC ATC CCT CTC TGT ATA GTT CCT 
lis Arg Trp Lsu Val Val Lsu 21s Ssr Kst Ala Val Cys lis Vsl Als 
155 160 165 

ATC ATC ATC TTC TCC ACC TCC TTT TCC TAT AAC CAT TAT TCT AAG ACT 
Mst 21s 21s Phs Ssr Ssr Cys Phs Cys Tyr Lys His Tyr Cys Lys Ssr 

170 175 180 

ATC TCA ACC ACC CCT CCT TAC AAC CCT CAT TTC CAA CAC CAT CAA CCA 
21s Ser Ser Arg Gly Arg Tyr Asn Arg Asp Lsu Glu Gin Asp Glu Als 
185 190 195 

TTT ATT CCA CTA CCA CAA TCA TTC AAA CAC CTC ATT CAC CAC TCC CAA 
Phs 21s Pro Val Gly Glu Ser Lsu Lys Asp Lsu 21s Asp Gin Ssr Cln 
200 205 210 

AGC TCT CCC ACT CCA TCT CCA TTC CCT TTA TTC CTT CAC CCA ACT ATT 
Ser Ser Cly Ser Cly Ssr Cly Lsu Pro Lsu Lsu Vsl Cln Arg Thr lis 
215 220 225 230 

CCC AAA CAC ATT CAC ATC CTT CCC CAC CTT CCT AAA CCC CCC TAT CCA 
Ala Lys Gin 21s Gin Kst Val Arg Cln Val Gly Lys Gly Arg Tyr Gly 
235 240 245 

CAA CTA TGC ATC CCT AAA TCC CCT CCT CAA AAA CTC CCT CTC AAA CTC 
Clu Val Trp Met Cly Lys Trp Arg Cly Clu Lys Val Ala Val Lys Val 
250 255 260 

TTT TTT ACC ACT CAA CAA CCT ACC TCC TTT ACA CAA ACA CAA ATC TAC 
Phe Phe Thr Thr Clu Glu Ala Ssr Trp Phs Arg Glu Thr Glu 21s Tyr 
265 270 275 

CAC ACC CTC TTA ATC CCT CAT CAA AAT ATA CTT CCT TTT ATA CCT CCA 
Gin Thr Val Leu Met Arg His Glu Asn 21s Lsu Gly Phs 21s Ala Ala 
280 285 290 

CAC ATT AAA CCC ACT CCT TCC TCC ACT CAC CTC TAT TTC ATT ACT CAT 
Asp lie Lys Cly Thr Gly Ssr Trp Thr Gin Lsu Tyr Lsu 21s Thr Asp 
295 300 305 310 

TAC CAT CAA AAT CCA TCT CTC TAT CAC TTC CTC AAA TCT CCC ACA CTA 
Tyr His Glu Asn Cly Ssr Leu Tyr Asp Phs Lsu Lys Cys Ala Thr Lsu 
315 320 325 

CAC ACC ACA CCC CTA CTC AAC TTA CCT TAT TCT GCT GCT TCT CCT CTC 
Aep Thr Arg Ala Lsu Lsu Lys Lsu Ala Tyr Ssr Ala Ala Cys Gly Lsu 



570 
618 
666 
714 
762 
810 
858 
906 
954 
1002 
1050 
1098 
1146 
1194 
1242 
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335 
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TCC CAC CTC CAC ACA CAA ATT TAT CCT ACC CAA 
Cys His Leu His Thr Clu XI* Tyr Cly Thr Oln 
345 350 

CCT CAT CCA CAC CTC AAG ACC AAA AAC ATC CTT 
Alt Hit Arg Asp Leu Lys Ser Lys Asn He Leu 

360 365 

ACT TCC TCT ATT CCT CAC CTC CCC CTA CCT CTT 
Ser Cys eye lie Ala Asp Leu Cly Leu Ale Val 

375 380 385 

ACA AAT CAA CTT CAC ATA CCC TTC AAT ACC ACC 
Thr Asn Clu Val A«p He Pro Leu Asn Thr Arc 
395 400 

TAC ATC CCT CCA CAA CTC CTC CAT CAA ACC CTC 
Tyr Met Ale Pro Clu Val Leu Asp Clu Ser Leu 
410 415 

CAG CCC TAC ATC ATC CCT CAC ATC TAT ACC TTT 
Cln Pro Tyr He Met Ala Asp He Tyr Ser Phe 
425 430 

CAA ATC CCT CCT CCT TCT ATT ACA CCA CCA ATC 
Clu Met Ala Arg Arg Cys He Thr Cly Cly He 
440 445 



CCC AAC CCT CCA ATT 
Gly Lys Pro Ala He 

355 

ATT AAG AAA AAT CCA 
He Lys Lye Asn Gly 

370 

AAA TTC AAC ACT CAT 
Lye Phe Asn Ser Asp 

390 

CTC CCC ACC AAG CCC 
Val Cly Thr Lys Arg 
405 

AAT AAA AAC CAT TTC 
Asn Lys Asn ale Phe 
420 

CCT TTC ATC ATT TCC 
Cly Leu He lie Trp 
435 

CTC CAC CAA TAT CAA 
Val Clu Clu Tyr Cln 
450 



TTA CCA TAT TAC AAC ATC CTC CCC ACT CAC CCA 
Leu Pro Tyr Tyr Asn Met Val Pro Ser Asp Pro 
455 460 465 

CCT CAC CTT CTC TCT CTC AAA CCC TTC CCC CCA 
Arg Clu Val Val Cys Val Lys Arg Leu Arg Pro 
475 480 

TCC AAC ACC CAT CAA TCT CTT CCA CCA CTT TTC 
Trp Asn Ser Asp Glu Cys Leu Arg Ala Val Leu 
490 495 

TCT TCC CCC CAT AAT CCA CCC TCC ACA CTC ACA 
Cys Trp Ala His Asn Pro Ala Ser Arg Leu Thr 
505 510 

AAC ACA CTT CCA AAA ATC CTT CAA TCC CAC CAT 
Lys Thr Leu Ala Lys Met Val Clu Ser Cln Asp 
520 525 

TCACAATTAA ACAATTTTCA CCCACAATTT ACACTCCAAC 

CCCTCCCATT ACCATCCAAT ACCATCTTCA CTTCCTTTCC 

TCACACCCTC CTAACACTAA ACCTTACCCT ACTCTACACA 

ACTTCAAACA TGTCATTCTT TATATATCAC AGCTTTCTTT 

TCCTTTTTTT CTTTTCTT 



TCC TAT CAC CAC ATC 
Ser Tyr Glu Asp Met 
470 

ATC CTC TCT AAC CCC 
He Val Ser Asn Arg 
485 

AAC CTA ATC TCA GAA 
Lys Leu Met Ser Glu 
500 

CCT TTC ACA ATC AAC 
Ala Leu Arg He Lye 

515 

CTA AAC ATT 
Val Lys He 

530 

AACTTCTTCA CCCAACCAAT 
ACACTCCTTC CTCTACATCT 
ATACAACATT CCAACTTCCA 
TAATCTGCCC TTTTTTTGTT 



1290 

1338 

1386 

1434 

1482 

1530 

1576 

1626 

1674 

1722 

1770 

1812 

1872 
1932 
1992 
2052 
2070 
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(2) INFORMATION FOR 8XQ ID BO I 14 1 

(1) SKQOZNCX CHARACTIRXSTICSl 

(A) LSMCTHt 532 amino acids 

(B) TYPXt «aino acid 
<D) TOPOLOCYt linear 

(ii) MOLECOLB TTPI: protein 

(xi) SEQUENCE DESCRIPTION: SZQ ID NO: 14: 

Met Thr Cln L.u Tyr Thr Tyr II. Arc Leu L.u Oly Al. Cy. Lmu Ph. 

5 1° 15 

II. II. Ser Hi. V.l Cln Oly Cln A.n L.u Asp Ser M.t Leu Hi. oly 

20 25 30 

Thr Cly Met Ly. S.r A.p L.u A.p cln Ly. Ly. Fro Clu A.n Civ V.l 

35 40 45 

Thr L.u Ala Fro Clu A.p Thr L.u Pro Ph. L.u Ly. Cy. Tyr Cy. s.r 

60 

Cly Hi. Cy. Pro A.p A.p Ala II. A.n A.n Thr Cy. II. Thr A.n Cly 

75 80 

Hi. Cy. Phe Ala lie xi. Clu Clu A.p A.p cln Cly Clu Thr Thr Leu 

85 90 95 

Thr Ser Cly Cy. Met Ly. Tyr Clu Cly S.r A.p Ph. Cln Cy. Ly. A»p 

100 105 xiO 

Ser Pro Ly. Ala Cln L.u Arg Arg Thr XI. Clu Cy. Cy. Arg Thr A.n 



L.u Cy. A.n Cln Tyr L.u Olh Pro Thr L«u Pro Pro V.i Val II. civ 

J " ,u 135 140 * 

Pro Phe Phe A.p Cly Ser II. Arg Trp L.u V.i Val Leu II. s.r Met 

10 155 160 

Ala Val cy. lie Val Ala Met II. n. p h . s.r S.r Cy. Ph. Cy. Tyr 

Ly. Hi. Tyr Cy. Ly. S.r II. s.r S.r Arg Cly Arg Tyr A.n Arg A.p 



190 



Leu Clu Cln A.p Clu Ala Phe II. Pro V.I Cly Clu Ser Leu Ly. A.p 

1SS 200 205 

Leu lie A.p Cln S.r Cln Ser Ser Cly Ser Cly S.r Cly Leu Pro Leu 

215 220 

Leu Val Cln Arg Thr lie Ala Ly. Cln II. Cln M.t Val Arg Cln V.1 

230 235 240 

Cly Lye Cly Arg Tyr Cly Clu Val Trp Met Cly Ly. Trp Arg Cly Clu 
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Lya Val Ala Val Lya Val Ph. Phe Thr Thr Clu Clu Ala Sar Trp Ph. 

260 265 270 

Arg Clu Thr Clu lit Tyr Cln Thr Val Leu Nat Arg His clu Aan Zla 

275 280 285 

fh * II * A1 * A1 * A "P xl * C1 V n»r «y *«r Trp Thr Cln 

290 295 300 

Leu Tyr Leu Ila Thr Aap Tyr Hia Clu Aan Cly Sar Leu Tyr Aap Pha 

305 310 315 320 

Lau Lya Cya Ala Thr Lau Aap Thr Arg Ala Lau Lau Lya Lau Ala Tyr 

325 330 335 

Sar Ala Ala Cya Cly Lau Cya Hia Lau Hia Thr Clu Ila Tyr Cly Thr 
3*0 345 3 | 0 

Cln Cly Lya Pro Ala Ila Ala Hia Arg Asp Lau Lya Ser Lya Aan Ila 

355 360 365 

Leu Ila Lya Lya Aan Cly Sar Cya Cya Ila Ala Aap Lau Cly Lau Ala 

370 375 38 q 

Val Lya Pha Asn sar Aap Thr Aan Clu Val Aap Ila Pro Lau Asn Thr 

385 390 395 400 

Arg Val Cly Thr Lya Arg Tyr Met Ala Pro Clu Val Lau Aap Clu Sar 
405 410 41S 

Lau Aan Lya Asn Hia Pha Cln Pro Tyr Ila Met Ala Aap lie Tyr Sar 
420 425 430 

Pha Cly Leu Ila Ila Trp Clu Met Ala Arg Arg Cya Ila Thr Cly Cly 
435 440 445 * * 

Ila Val Clu Clu Tyr Cln Lau Pro Tyr Tyr Aan Mat Val Pro Sar Aap 
450 455 460 

Pro Ser Tyr Clu Aap Mat Arg Clu Val Val Cya Val Lye Arg Lau Arg 

Pro He Val Ser Aan Arg Trp Aan Sar Aap Clu Cya Lau Arg Ala Val 
485 490 495 

Leu Lya Leu Met Ser Clu Cya Trp Ala Hia Aan Pro Ala Ser Are Lau 

500 505 sio 

Thr Ala Leu Arg He Lya Lya Thr Leu Ala Lya Met Val Clu ser Gin 
515 520 525 

Asp Val Lya Ila 

530 

(2) INFORMATION FOR SEQ ID HO t 15 I 

(i) SEQUENCE CHARACTERISTICS I 

(A) LENGTH: 2160 baBa pairs 
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(B) TYPE: nuel.ie .eld 

(C) strandedhess: unknown 

(D) topology i lin.ar 

(11) MOLZCOLS TYPE: cDNA 

(111) HYPOTHETICAL: NO 

(111) ANTI -SENSE: NO 

(v) rRAOKEKT TYPE J internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Moua. 

(Lx) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 10.. 1524 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO: IS: 



CCCGGTTAC ATO GCG CAC TCC GCC GCA CCC TCC TCC TTC TTC CCC CTT 
M.t Al. Clu S.r Al. Gly Ala S.r S.r - J2 JE Iro 21 
45 10 

5T? 5 T ? f 70 " C CCC CCC AGC GCC CCC TCC CCC CCC CCC CCC ATC 

Val Val Leu L.u L.u Ala Gly S.r Cly Cly S.r Gly PrS JTg SJ K 

CAC CCT CTC CTC TCT CCC TCC ACC AGC TCC CTA CAC ACC AAC TAC ACC 
Gin Ala L.u L.u Cy. Ala Cy. Thr S.r Cy. L.u Gin JS JiS "r t£ 

40 4S 

TCT CAC ACA CAT CCC OCT TCC ATC CTC TCC ATC TTT AAC ere r.x-r 
Cy. Glu Thr A.p Gly Al. Cy. H.t Val sH *2 £S S £J 
80 55 60 

CTC CAG CAC CAT CTA CCT ACC TCC ATC CCC AAC CTC CAC era err r«~ 
Val Clu Hi. Hi. Val Aro Thr Cy. II. ,£ J£ X SaT K 

os 70 75 

OCT CCA AAG CCC TTC TAC TCC CTC ACT TCA CAC CAT CTC ece xxr »n» 
Ala Cly Ly. Pro Ph. Tyr Cy. L.u S.r S.r Oil 1% X Jg 
BU 85 90 



48 



96 



144 



192 



240 



288 



336 



CAC TGC TGC TAT ATT CAC TTC TCC AAC AAG ATT CAC CTC ACC crn nnn 
Hi. C y. cy. Tyr II. A.p Ph. Cy. A.n Ly. }" 2? 2S Si S 

100 205 

ACC CGA CAC CTC AAC CAC CCT CCC CAC CCC TCC ATC Tee r_/*o 
ser Cly Hi. L.u Ly. Gl« Pro Al. Hi. S X S3 Trp SJ ?S 55 

120 225 
CAC CTC CTC CCC ATC ATC CCC CCC CCC CTC TTC CTC CTC w m 
Clu Val Cly II. II. Al* Cly Pro v5 2S S5 J£ 2 X 

"° 135 140 
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480 



71 

iH ill ?Tf SI? CTC CTC ATC AAC TXT CAC CAC CCT OTC TAC CAT 
II. II. II. V.1 Ph. Leu v.l 2i. A.n Tyr Hit Ola £g vll Tyr SI 

150 X55 

AAC CCC CAC ACC TTC CAC ATC CAC CAC CCC TCT TCC CAC ATC TCT CTC e, a 
A.n Arg Cln Arg L.u A.p M.t Clu A.p Pro 5.r ™ SS JI? gj 25 528 

" w "5 170 

TCC AAA CAC AAC ACC CTC CAC CAT CTC CTC TAG e»r 

s.r l ? . a. p ly . »» ^ jj. SJ 25 51? S£ 2S g S 12 ! " 

*" 180 



624 



720 



CCC TCT CCC TCA CCC TTA CCC CTT TTT CTC CAC CCC AC* err. iv*. «.« 
Cly S.r Cly S.r Cly L.u Pro L.u p h . 51? SS £J *g ?» g £J 

195 200 205 

ACC ATT CTT TTA CAA CAC ATT ATC CCC AAC CCC CCC TTC CCC CAX CTA 
Thr 11. vi L.u Cln Clu 21. 21. Cly Ly. Si? IE Stt Slf <?2 
"° 215 220 

TCC CCT CCT CCC TCC ACC CCT CCT CAC CTC CCT CTC AAA ATC TTC TCT 
Trp Arg Cly Arg Trp Arg Cly Cly A.p Val.Al. vS JJJ S £1 
" 3 230 235 

TCT CCT CAA CAA CCC TCT TCC TTC CCT CAA CCA CAC ATC TAC CAC Aee 

S.r Arg Clu Clu Arg S.r Trp Ph. Arg Clu Al. oX ill SS «£ ?M 

245 250 

si? s{ s s, c s . e fi is s s s; £ in s s s s 

260 265 

AAA CAT AAT CCC ACC TCC ACC CAC CTC TCC CTT CTC TCT e*r *%m 
Ly. A.p A.„ Cly Thr Trp Thr Cln L.u S£ 2 ?2 E 2J £1 SI 

* 7S 280 285 

° AC F* T ?? C Tc * CTC TTT CAT TAT CTC AAC CCC TAC ACA CTC ACC ATT o,, 
Clu ha. Cly s.r L.u Ph. A.p Tyr L.u A.n Arg Tyr Tte SI? iS x™ " 2 
2S0 «« 300 

CAC GGA ATC ATT AAC CTA CCC TTC TCT CCA CCC ACT ccr tpt /v» 
Clu Cly Met II. Ly. L.u Al. L.u S.r S K S5 £ J 2 S SI 

JW3 310 



960 



CTC CAT ATC CAC ATT CTC CCC ACT CAA CCC AAC ere cri *~ ru~m ^.^ 
L.u Hi. K.t Clu 21. V.1 Cly Thr S2 £J i£ £ !g S S SI 

325 330 

CCA CAC TTC AAC TCA AAC AAC ATC CTC CTC AAA AAA AAT CCC ATC w* 
Arg A.p Lou Ly. S.r Ly. jjj II. L«u vll Ly. Ly. A*n Cly & ?! 

CCC ATT CCA CAC CTC CCC CTC CCT CTC CCT CAT CAT CCC err &ra 
Alj lie Al. A.p Leu Cly L.u Al. Val ArJ SI a£ AU SI? «S Sp 

360 365 

ACC ATA CAC ATT CCT CCA AAT CAC ACC CTC CCC ACC AAA eel fin <...« 
Thr 21. A.p 21. Al. Pro A.n cln Arg vll Cly Thr Ly. SJ JI? 

370 37 5 380 



1008 



1056 



1104 



WO 94/11502^ ^ PCT/GB93/02367 

CCT CCT CAA CTC CTT CAC GAG ACA ATC AAC ATO AAC CAC TTT GAG TCC 1200 
Ala Pro Glu Val Lau Asp Glu Thr Zla Aan Mat Lya Hit Pha Aid Str 
385 390 395 

TTC AAA TGT GCC CAC ATC TAT OCC CTC GGG CTT CTC TAC TOG GAG ATT 1248 
Pha Lya Cy» Ala Asp 21a Tyr Ala Lau Gly Leu Val Tyr Trp Glu Zla 
400 405 410 



CCA CCA AGA TGC AAT TCT CCA CCA CTC CAT GAA CAC TAT CAA CTC CCC 1296 
Ala Arg Arg Cya Aan far Gly Gly Val Hit Glu Asp Tyr Gin Lau Pro 
415 420 425 

TAT TAC CAC TTA CTC CCC TCC CAC CCT TCC ATT GAG GAG ATC CCA AAC 1344 
Tyr Tyr Asp Lau Val Pro Sar Aap Pro Sar Zla Glu Glu Kat Arg Lya 
430 435 440 445 

CTT GTA TCT CAC CAC AAC CTA CCC CCC AAT CTC CCC AAC TGC TCG CAC 1392 
Val Val Cya Asp Gin Lya Lau Arg Pro Aan Val Pro Aan Trp Trp Gin 
450 455 460 

ACT TAT CAG GCC TTC CCA CTC ATC CCA AAG ATC ATC CCC GAG TCC TCC 1440 
Sar Tyr Glu Ala Lau Arg Val Mat Gly Lya Mat Mat Arg Glu Cya Trp 
465 470 475 

TAC CCC AAT CCT CCT CCC CCT CTC ACA CCT CTC CCC ATC AAC AAC ACT 1488 
Tyr Ala Aan Gly Ala Ala Arg Lau Thr Ala Lau Arg Zla Lya Lya Thr 
480 485 490 

CTC TCC CAC CTA ACC CTG CAC CAA CAT CTC AAG ATT TAACCTCTTC 1534 
Lau Ser Cln Lau Sar Val Gin Glu Aap Val Lya Zla 
495 500 505 



CTCTGCCTAC 


ACAAAGAACC 


TGGCCACTCA 


GGATGACTCC 


ACCCACCCTC 


CAACCGTCCT 


1594 


CCACCCCTAT 


CCTCTTCTTT 


CTCCCCCCCC 


CTCTCCCACA 


GCCCTCCCCT 


GCAACACCCA 


1654 


CACACCCTCC 


CAGACCOCCC 


CACTCCCCTT 


CCCTTTGACA 


CACACACTTT 


TTATATTTAC 


1714 


CTCCTCATCC 


CATCCACACC 


TGACCAAATC 


ATGTACTCAC 


TCAATGCCAC 


AACTCAAACT 


1774 


CCTTCACTCC 


GAAGTACACA 


CACCCAGTCC 


ATTCCCTCTC 


CACCACCGTC 


ACCTCCTCCC 


1634 


CTCGCCACCA 


CCCCCCCCCA 


TACCTTGTCC 


TCCACTGCCC 


TCCAGGTTTT 


CCTCCACGCA 


1894 


CCAGTCAACT 


CCCATCAAGA 


TATTGAGACG 


AACCCCAAGT 


TTCTCCCTCC 


TTCCCCTACC 


1954 


ACTCCTCACC 


CACACCATCC 


TTCTCATCCA 


CATCCCCACC ACTCCCCCTA CACACACAAC 


2014 


CTC CTG C CTG 


TCTCTCCACC 


CAACTCCCCA 


TGTCCCCACC 


TCTCTCCCAC 


ATTCTCCCTG 


2074 


CTCTGTCCCA 


CCCCCGTCTC 


TCTGTGTGTG 


TCTCTCAGTG 


AGTCTGTCTC 


TCTACACTTA 


2134 


ACCTCCTTCA 


GCTTCTGTGC 


ATC TCT 








2160 



(2) INFORMATION FOR SEQ ZD HO: 16s 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 505 amino acida 
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(8) TTPXi amino acid 
(0) TOPOLOGY i linaar 

(ii) MOLXCULE TYPE: protein 

(xi) SEQUXNCt DrSOUPTZONi BZQ ZD NOt 16s 

Met Ala Clu Ser Ala Cly Ala Sar Bar Phe Pha Pro Lau Val Val Lau 

1 . 5 10 15 

Leu Lau Ala Cly Sar Cly Cly Sir Cly Pro Arg Cly lie Cln Ala Leu 

20 25 30 

Leu Cya Ala Cya Thr Sar Cya Lau Cln Thr Aan Tyr Thr Cya Clu Thr 
35 40 45 

Aap Cly Ala Cya Met Val Sar Zla Pha Aan Leu Aap cly Val Clu Bia 

50 55 £0 

Hia Val Arg Thr Cya Zla Pro Lya Val Clu Leu Val Pro Ala Cly Lya 

65 70 75 80 

Pro Pha Tyr Cya Leu Sar Sar Clu Asp Lau Arg Aan Thr Hit Cya Cya 

85 90 95 

Tyr lie Asp Phe Cya Asn Lya Zla Aap Lau Arg Val Pro Sar Cly Hia 

100 105 no 

Leu Lya Clu Pro Ala Hia Pro Ser Met Trp Cly Pro Val Clu Leu Val 
115 120 125 

Cly lie Zla Ala Cly Pro Val Pha Leu Leu Phe Leu Zle Zle Zle Zle 

130 135 140 

Val Phe Leu Val Zle Asn Tyr His cln Arg Val Tyr Hia Aan Arg Cln 
"5 150 155 160 

Arg Leu Asp Mat Clu Asp Pro Sar Cys Clu Met Cya Leu Ser Lya Asp 
165 170 175 

Lya Thr Leu Cln Asp Lau Val Tyr Asp Leu Ser Thr Ser Cly Ser Civ 
180 165 190 

Ser Cly Leu Pro Leu Pha Val Cln Arg Thr Val Ala Arg Thr Zle Val 
195 200 205 

Leu Cln Clu Zle Zle Cly Lys Cly Arg Phe Cly Clu Val Trp Aro -Civ 
210 215 220 

Arg Trp Arg Cly Cly Asp Val Ala Val Lya Zle Phe Ser Ser Arg Clu 
225 230 235 240 

Clu Arg Ser Trp Phe Arg Clu Ala Clu Zle Tyr Cln Thr Val Met Leu 
245 250 255 

Arg Hia Clu Aan Zle Leu Cly Pha Zle Ala Ala Aap Aan Lya Aap Aan 

260 265 270 
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Cly Thr Trp Thr Gin Leu Trp Leu Val Ser Aep Tyr His Olu His Oly 

275 280 285 

Ser Leu Phe Asp Tyr Leu Asn Arg Tyr Thr Val Thr He Clu Cly Met 

290 295 300 

Zle Lys Leu Ala Leu Ser Ala Ala Ser Cly Leu Ala Hie Leu Hie Met 

305 310 315 320 

Clu Zle Vai Cly Thr Cln Cly Lye Pro Cly Zle Ala Hie Arg Aep Leu 

325 330 335 

Lye Ser Lye Aen lie Leu Val Lye Lye Aen Cly Met Cye Ale He Ale 

340 345 350 

Asp Leu Cly Leu Ala Val Arg Hie Aep Ala Val Thr Asp Thr He Asp 

355 360 365 

He Ala Pro Aen Cln Arg Val Cly Thr Lye Arg Tyr Met Ale Pro Clu 

370 375 380 

Val Leu Asp Clu Thr He Aen Met Lys Hie Phe Asp Ser Phe Lye Cye 
385 390 395 400 

Ala Asp He Tyr Ala Leu Cly Leu Val Tyr Trp Clu Zle Ala Arg Arg 
405 410 415 

Cye Aen Ser Cly Cly Val His Clu Asp Tyr Cln Leu Pro Tyr Tyr Aep 
420 425 430 

Leu Val Pro Ser Asp Pro Ser He Clu Clu Met Arg Lys Val Val Cve 
435 440 445 1 

Asp Cln Lys Leu Arg Pro Aen Val Pro Aen Trp Trp Cln Ser Tyr Glu 
450 455 460 

Ala Leu Arg Val Met Cly Lys Met Met Arg Clu Cys Trp Tyr Ala Aen 

470 475 480 

Cly Ala Ala Arg Leu Thr Ala Leu Arg He Lye Lye Thr Leu Ser Cln 
485 490 495 

Leu ser Val Cln Clu Asp Val Lye Zle 

500 505 

(2) INFORHATZON FOR SEQ ZD HO: 17: 

(i) SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 1952 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 
(iii) KYPOTHETZCAL: MO 
(iii) ANTI -SENSE: NO 
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(v) rRACKTNT TTPI: internal 

(vi) ORIGINAL SOORCZ: 

(A) ORGAHZSKt Mou.a 

(ix) JTATUR2: 

(A) HAXS/XEYf COS 

(B) LOCATION i 187.. 1692 



PCT/GB93/02367 
75 



(xi) SEQUZNCX DESCRIPTION SSQ ID HOi 17» 

AAGCGCCGGC AGAAGTTGCC CCCGTGCTGC TCCTAGTCAG CCCCCCCAGG ACCCOCCACC 

TGGGAAGCGG CGCCCGGTTA ACTTCCCCTG AATCACAACC ATTTCCCCCT CAGCTATCAC 

AAGAGAGCAA ACAAAAAGT7 AAAGCAGCAA CCCCCCCATA AGTGAAGAGA GAAGTTTATT 

GATAAC ATG CTC TTA CCA ACC TCT CCA AAA TTA AAT CTC CGC ACC AAC 
Mat Lau Leu Arg Ser Ser Cly Ly. Lau A.n Val Cly Thr Lv. 
15 10 

AAG GAG CAT CCA GAG ACT ACA GCC CCC ACC CCT CCC CCC AAC ATC CTA 
Ly. Clu Asp Cly clu Ser Thr All Pro Thr Pro Arg Pro Ly. U t 21 
15 20 25 30 

CCT TCT AAA TCC CAC CAC CAC TCT CCC CAA CAC TCA CTC AAC AAT ATC 
Arg Cy. Ly. Cy. Hi. Hi. Hi. Cy. Pro Clu A.p S.r V.I Jin £n m 
35 40 45 

TCC AGC ACA CAT CCC TAC TCC TTC ACC ATC ATA CAA CAA CAT CAC TCT 
Cys Ser Thr Aep Cly Tyr Cy. Ph. Thr Mat II. Clu cij A.p j£p J« 
50 55 60 

CCA ATC CCT CTT CTC ACC TCT CCA TCT CTA CCA CTA CAA CCC TCA CAT 
Cly Het Pro Val Val Thr S.r Cly Cy. Lau Cly Lau Clu Cly sir A« 

65 70 75 r 

TTT CAA TCT CGT CAC ACT CCC ATT CCT CAT CAA ACA ACA TCA ATT rxx 
Phe Cln Cy. Arg A.p Thr Pro II. Pro Hi. SI j£J J£ S c£ 

85 90 

TCC TCC ACA CAA AGC AAT CAC TCT AAT AAA CAC CTC CAC CCC ACT CTC 
cy. Cy. Thr Clu Arg A.n Clu Cy. A.n Ly. A.p Lau Hi. Pro l£ 
95 100 10S 110 

CCT CCT CTC AAG CAC AGA CAT TTT CTT CAT CCC CCC ATA CAC CAC AAC 
Pro Pro Leu Ly. Asp Arg Asp Phe Val Asp Cly Pro II. Hi. Hi. Ly. 

115 120 225 

CCC TTC CTT ATC TCT CTC ACT CTC TCT ACT TTA CTC TTC CTC CTC ATT 
Ala Leu Leu II. Ser Val Thr Val Cy. Sar Lau Leu Lau Val Lei II. 
130 135 140 

ATT TTA TTC TCT TAC TTC ACC TAT AAA ACA CAA CAA CCC CCA CCT CCC 

lie Leu Phe Cy. Tyr Ph. Arg Tyr Ly. Arg Cln Clu Ala Arg Pro Arc 
145 150 * 



60 
120 
180 
228 

276 

324 

372 

420 

468 

516 

564 

612 

660 
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3S0 



708 



804 



76 

TAC ACC ATT GOG CTC GAG CAC CAC GAG ACA TAC ATT CCT CCT CGA GAG 
160 165 A * P ^ *** {7* ,r ° * r ° Cly CiU 

TCC CTC ACA CAC TTC ATC CAC CAC TCT CAC ACC TCC CCA ACT CCA TCA 756 
S.r L.u Arg Asp Leu II. Glu Gin S.r Gin Ser Sor Cly s.r Gly Ser 
175 180 185 190 

GCC CTC CCT CTC CTC CTC CAA ACC ACA ATA CCT AAO CAA ATT CAC ATC 

Cly Leu Pro Leu Leu V*l cin Arg Thr He AU Lye Cln He Gin Met 

"5 200 205 

CTC AAC CAC ATT CCA AAA CGC CCC TAT CGC CAC GTC TCC ATC GGA AAG 8S2 
V.l Ly. Gin lie Cly Ly. Cly Arg Tyr Cly Clu V.l Trp Met Cly Ly. 
210 215 22O 

TCC CCT CCA CAA AAC CTC CCT GTC AAA CTC TTC TTC ACC ACC CAC CAA 900 
Trp Arg Cly Clu Ly. V*l Ale V.l Ly. V.l Ph. Ph. Thr ?5 ClS Clu 
225 230 235 

CCC ACC TCC TTC CCA CAC ACT GAG ATA TAT CAC ACC CTC CTC ATC CCC 948 
Al* s.r Trp Ph. Arg Clu Thr Glu Zle Tyr Cln Thr V*l Leu Met Arg 
240 245 250 

CAT CAC AAT ATT CTC CCC TTC ATT CCT CCA GAT ATC AAA GCC ACT GCC qo« 
Hi. Glu A.n lie Leu Cly Ph. 11. Al. AL A.p lie Ly. Cly Thr Cly 

255 260 265 270 

TCC TCC ACT CAC TTC TAC CTC ATC ACA CAC TAT CAT CAA AAC CCC TCC 1044 
S.r Trp Thr Cln Leu Tyr L.u XI. Thr A.p Tyr Hi. Clu A.n Gly S.r 
275 280 285 

CTT TAT CAC TAT CTC AAA TCC ACC ACC TTA CAC CCA AAC TCC ATC CTC 
*Y r AB P *Y* te« *Y« S.r Thr Thr Lsu A.p Al* Ly. S.r M.t Leu 
290 295 300 

AAC CTA CCC TAC TCC TCT CTC ACC CCC CTA TCC CAT TTA CAC ACC CAA 1140 
Ly. Leu Al. Tyr Ser S.r Val S.r Gly L«u Cy. Hi. L.u Hi. Thr Clu 
305 310 3i 5 

ATC TTT ACC ACT CAA CCC AAC CCA CCA ATC CCC CAT CCA CAC TTC AAA 11BB 
lie Ph. ser Thr Cln Cly Ly. Pro Al. II. Al. Hi. Arg A.p Leu Ly. 
320 325 330 

ACT AAA AAC ATC CTC GTC AAC AAA AAT CCA ACT TCC TCC ATA CCA CAC 
S.r Ly. A.n II. L.u v.l Ly. Ly. A.n Gly Thr Cy. Cy. He Al. A.p 
335 340 345 



1092 



1236 



1284 



CTC CGC TTC CCT GTC AAG TTC ATT AGT GAC ACA AAT CAC CTT CAC ATC 
Leu Cly Leu Al* V.l Ly. Ph. II. ser A.p Thr A.n clu V*l A.p uS 
35 5 360 365 

CCA CCC AAC ACC CCC CTT CCC ACC AAG CCC TAT ATC CCT CCA CAA CTC 1335 
Pro Pro A.n Thr Arg V.l Cly Thr Ly. Arg Tyr M.t Pro Pro Clu 5*1 
370 375 jgg 

CTC GAC GAC ACC TTC AAT ACA AAC CAT TTC CAO TCC TAC ATT ATC CCT 1380 
Leu Asp Clu Ser Leu Asn Arg A.n Hi. Phe Cln Ser Tyr He Met Al* 
385 390 395 
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GAC ATG TAC ACC TTT CCA CTC ATC CTC TCC CAO ATT CCA AGO ACA TCT 1428 
Asp Met Tyr Ser Phe Cly Leu Xle Leu Trp Glu Xle Ala Arg Arg Cys 
400 405 410 

CTT TCT OCA CGT ATA CTC CAA CAA TAC CAC CTT CCC TAT CAC CAC CTC 1476 
Val Ser Cly Gly Xle Val Clu Clu Tyr Gin Leu Pro Tyr His Asp Leu 
415 420 425 430 

GTC CCC ACT CAC CCT TCT TAT GAG CAC ATG AGA GAA ATT CTC TCC ATG 1524 
Val Pro Ser Asp Pro Sar Tyr Clu Asp Mat Arg Glu I la Val Cys Mat 
435 440 445 

AAC AAC TTA CCC CCT TCA TTC CCC AAT CCA TCC ACC ACT CAT GAG TCT 1572 
Lys Lys Leu Arg Pro Sar Pha Pro Asn Arg Trp Ser Sar Asp Glu Cys 
450 455 460 

CTC ACC CAC ATC GCG AAC CTT ATC ACA GAG TCC TCC CCC CAC AAT CCT 1620 
Leu Arg Gin Met Gly Lys Leu Met Thr Glu Cys Trp Ala Gin Asn Pro 
465 470 475 

CCC TCC ACC CTC ACC CCC CTC ACA CTT AAG AAA ACC CTT CCC AAA ATC 1668 
Ala Ser Arg Leu Thr Ala Leu Arg Val Lys Lys Thr Leu Ala Lys Met 
480 485 490 

TCA CAC TCC CAC CAC ATT AAA CTC TCACCTCACA TACTTCTCCA CACACCAACA 1722 
Ser Glu Ser Gin Asp lie Lys Leu 
495 500 

ATTTCACACA AGCATCCTTA CCCCAACCCT TCAACCTTAC CCTACTCCCC ACTCACTTCA 1782 

CACTTTCCTC CAACAGACCA CGCTCCCCAG ACACAGACCA ACCCAGAAAC ACCCATTCAT 1842 

CATGCCTTTC TCACCACCAG AAACTCTTTC GCTAACTTCT TCAACATATC ATCCATCTTC 1902 

CTTTCTAAGA AACCCCTCTA TTTTCAATTA CCATTTTTTT ATAAAAAAAA 1952 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS I 

(A) LENGTH: 502 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ZD NO: 16: 

Met Leu Leu Arg Ser Ser Gly Lya Leu Asn Val Gly Thr Lys Lys Glu 
15 10 15 

Asp Gly Glu Ser Thr Ala Pro Thr Pro Arg Pro Lys lie Leu Arg Cys 
20 25 30 

Lys Cys His fiis Hie Cys Pro Clu Asp Ser Val Asn Asn Zle Cya Ser 
35 40 45 



Thr Asp Gly Tyr Cys Pha Thr Mat He Glu Glu Asp Asp Ser Gly Met 
50 55 60 
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Pro Val Val Thr Ssr Cly Cys Leu cly Leu Clu Cly Ser Asp Phe Oln 

65 70 75 eo 

Cys Arg Asp Thr Pro lit Pro His Cln Arg Arg Ssr lit Clu Cys Cys 
85 90 95 

Thr Clu Arg Asn Glu Cys Atn Lys Asp Leu His Pro Thr Leu Pro Pro 
100 105 no 

Leu Ly» Asp Arg Asp Phe Val Asp Cly Pro lis His His Lys Ala Leu 

115 120 125 

Leu lie Ser Val Thr Val Cys Ser Lsu Leu Leu Val Leu Us lis Leu 

130 135 140 

Phs Cys Tyr Phs Arg Tyr Lys Arg Cln Clu Ala Arg Pro Arg Tyr Ssr 
US 150 155 iso 

lis Cly Lsu Clu Cln Asp Clu Thr Tyr lis Pro Pro Cly clu Ssr Lsu 
165 170 175 

Arg Asp Leu lis Clu Cln Ser Cln Ser Ser Cly Ser Cly Ser Cly Leu 
180 185 190 

Pro Leu Leu Val Cln Arg Thr Zle Ala Lys Cln lie Cln Het Val Lys 

195 200 205 

Cln lie Cly Lys Cly Arg Tyr Cly Clu Val Trp Mst Cly Lys Trp Aro 
210 215 220 

Cly Glu Lys Val Ala Val Lys Val Phs Phs Thr Thr Clu Clu Ala Ser 
225 230 235 240 

Trp Phe Arg Clu Thr Clu lie Tyr Cln Thr Val Leu Met Arg Bis Clu 
245 250 255 

Asn lie Leu Cly Phe He Ala Ala Asp He Lys Cly Thr Cly Ser Trp 

250 265 270 

Thr Cln Leu Tyr Leu lie Thr Asp Tyr His Clu Asn Cly Ssr Leu Tyr 

275 280 285 

Asp Tyr Leu Lys Ser Thr Thr Leu Asp Ala Lys Ssr Mst Leu Lys Leu 

290 295 300 

Ala Tyr Ser Ser Val Ser Cly Leu Cys His Leu His Thr Clu He Phe 
305 310 3is 320 

Ser Thr Cln Cly Lys Pro Ala He Ala His Arg Asp Lsu Lys Ser Lys 
225 330 335 

Asn He Leu Val Lys Lys Asn Cly Thr Cys Cys He Ala Asp Leu Cly 
340 345 350 

Leu Ala Val Lys Phe He Ser Asp Thr Asn Clu Val Asp He Pro Pro 

355 360 365 

Asn Thr Arg Val Cly Thr Lys Arg Tyr Met Pro Pro Clu Val Leu Asp 

370 375 380 
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Glu Ser Leu Asn Arg Asn His Phe Gin Ser Tyr II* Met Ala A»p Met 

385 390 395 400 

Tyr Ser Phe Gly Leu Zle Leu Trp Glu Zle Ala Arg Arg Cye Val Ser 
405 410 415 

Gly Gly lie Val Glu Glu Tyr Gin Leu Pro Tyr Hie Asp Leu Val Pro 
420 425 430 

Ser Asp Pro Ser Tyr Glu Asp Met Arg Glu lie Val Cye Met Lye Lye 
435 440 445 

Leu Arg Pro Ser Phe Pro Asn Arg Trp Ser Ser Aep Glu Cye Leu Arg 
450 455 460 

Gin Met Gly Lys Leu Met Thr Glu Cys Trp Ala Gin Asn Pro Ala Ser 
465 470 475 480 

Arg Leu Thr Ala Leu Arg Val Lye Lys Thr Leu Ala Lys Met Ser Glu 
485 490 495 

Ser Gin Asp Zle Lys Leu 



(2) INFORMATION POR SEQ ZD NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ZD NO: 19: 
CCGCATCCTG TTCTCAAGGN AATATGTC 28 



(2) INFORMATION FOR SEQ ZD NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: a ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDKA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ZD HO l 20s 
GCGATCCGTC CCACTCAAAA TTTT 



(2) INFORMATION FOR 8EQ ZD NOt 21s 

(i) SEQUENCE CHARACTERISTICS l 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: eDKA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE s NO 



(xi) SEQUENCE DESCRIPTION: SEQ ZD NOs 21s 
CCCGATCCCC CATATATTAA AAGCAA 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTH: 20 base pairs 

(B) TTPE: nuclaic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TTPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 



(Xi) SEQUENCE DESCRIPTIONS SEQ ID NOs 22: 
CGGAATTCTC GTCCCATATA 



(2) INFORMATION FOR SEQ ZD NOs 23: 

(i) SEQUENCE CHARACTERISTICS S 

(A) LENGTH: 37 bass pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS s single 

(D) TOPOLOGY s linear 

(ii) MOLECULE TYPES CDNA 
(iii) HYPOTHETICAL: NO 
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(ill) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION I SEQ ZD NO: 23 1 
ATTCAAGGGC ACATCAACTT CATTTCTCTC ACTCTTC 



(2) INFORMATION FOR SEQ ZD NO J 24 s 

<i) SEQUENCE CHARACTERISTICS! 

(A) LENGTH: 26 basa pairs 

(B) TYPE: nuclaic acid 

(C) STRAND ED NESS i sing Is 

(D) TOPOLOGY j linaar 

(ii) MOLECULE TYPE: cDKA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CCGCATCCAC CATGGCGGAG TCCGCC 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 baaa pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : aingla 

(D) TOPOLOGY: linaar 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
AACACCGGGC CGGCGATCAT 



(2) INFORMATION FOR SEQ ZD NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linaar 

(ii) MOLECULE TYPE: peptida 
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(v) FRAGMENT TYPE: intarnal 

(xi) SEQUENCE DESCRIPTION i SEQ ID HO I 26 s 

Gly Xaa Cly Xaa Xaa Gly 

1 5 



(2) INFORMATION FOR SEQ 10 NO: 27 1 

(i) SEQUENCE CHARACTERISTICS l 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linaar 

(ii) MOLECULE TYPE: paptida 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
Asp Phe Lya Sar Arg Asn 



(2) INFORMATION fOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linaar 

(ii) MOLECULE TYPE: paptida 



(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 28: 

Asp Leu Lya Sar Lya Asn 

1 5 



(2) INFORMATION FOR SEQ ID NO: 29 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linaar 

(ii) MOLECULE TYPE: paptida 



(xi) SEQUENCE DESCRIPTION: SEQ ID HOt 29i 

Cly Thr Lya Arg Tyr Mat 
1 5 
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(1) GENERAL INFORMATION: 
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Franzen, Petra; Yamashita, Hidetoshi; Heldin, Carl-Henrik 

(ii) TITLE OF INVENTION : Activin Receptor-Like Kinase, Protein 

Having Serine Threonine Kinase Domains And Their Use 



(iii) NUMBER OF SEQUENCES: 29 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Felfe & Lynch 

(B) STREET: 805 Third Avenue 

(C) CITY: New York City 

(D) STATE: New York 
(F) ZIP : 10022 

(v) COMPUTER READABLE FORM: 
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(B) COMPUTER: IBM 

(C) OPERATING SYSTEM: PC -DOS 

(D) SOFTWARE: Wordperfect 
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(B) FILING DATE: 2 8 -May- 1993 
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(B) FILING DATE: 2 -July- 1993 




(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 9136099.2 
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(ix) TELECOMMUNICATION INFORMATION: 
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(B) TELEFAX: (212) 838-3884 



(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1984 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE : cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 283.. 1791 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 

AGGAAACGGT TTATTAGGAG GGAGTGGTGG AGCTGGGCCA GGCAGGAAGA CGCTGGAATA 60 

AGAAACATTT TTGCTCCAGC CCCCATCCCA GTCCCGGGAG GCTGCCGCGC CAGCTGCGCC 120 

GAGCGAGCCC CTCCCCGGCT CCAGCCCGGT CCGGGGCCGC GCCGGACCCC AGCCCGCCGT 180 

CCAGCGCTGG CGGTGCAACT GCGGCCGCGC GGTGGAGGGG AGGTGGCCCC GGTCCGCCGA 240 

AGGCTAGCGC CCCGCCACCC GCAGAGCGGG CCCAGAGGGA CC ATG ACC TTG GGC 294 

Met Thr Leu Gly 
1 

TCC CCC AGG AAA GGC CTT CTG ATG CTG CTG ATG GCC TTG GTG ACC CAG 342 
Ser Pro Arg Lys Gly Leu Leu Met Leu Leu Met Ala Leu Val Thr Gin 
5 10 15 20 



GGA GAC CCT GTG AAG CCG TCT CGG GGC CCG CTG GTG ACC TGC ACG TGT 390 
Gly Asp Pro Val Lys Pro Ser Arg Gly Pro Leu Val Thr Cys Thr Cys 
25 30 35 

GAG AGC CCA CAT TGC AAG GGG CCT ACC TGC CGG GGG GCC TGG TGC ACA 438 
Glu Ser Pro His Cys Lys Gly Pro Thr Cys Arg Gly Ala Trp Cys Thr 
40 45 50 

GTA GTG CTG GTG CGG GAG GAG GGG AGG CAC CCC CAG GAA CAT CGG GGC 486 
Val Val Leu Val Arg Glu Glu Gly Arg His Pro Gin Glu His Arg Gly 
55 60 65 

TGC GGG AAC TTG CAC AGG GAG CTC TGC AGG GGG CGC CCC ACC GAG TTC 534 
Cys Gly Asn Leu His Arg Glu Leu Cys Arg Gly Arg Pro Thr Glu Phe 
70 75 80 

GTC AAC CAC TAC TGC TGC GAC AGC CAC CTC TGC AAC CAC AAC GTG TCC 582 
Val Asn His Tyr Cys Cys Asp Ser His Leu Cys Asn His Asn Val Ser 
85 90 95 100 

,qTG GTG CTG GAG GCC ACC CAA CCT CCT TCG GAG CAG CCG GGA ACA GAT 630 
^eu Val Leu Glu Ala Thr Gin Pro Pro Ser Glu Gin Pro Gly Thr Asp 
^ 105 110 115 

Hbc CAG CTG GCC CTG ATC CTG GGC CCC GTG CTG GCC TTG CTG GCC CTG 678 
'jj&ly Gin Leu Ala Leu lie Leu Gly Pro Val Leu Ala Leu Leu Ala Leu 
C3 120 125 130 

C%TG GCC CTG GGT GTC CTG GGC CTG TGG CAT GTC CGA CGG AGG CAG GAG 726 
-Val Ala Leu Gly Val Leu Gly Leu Trp His Val Arg Arg Arg Gin Glu 
p 135 140 145 

MAAG CAG CGT GGC CTG CAC AGC GAG CTG GGA GAG TCC AGT CTC ATC CTG 774 
MLys Gin Arg Gly Leu His Ser Glu Leu Gly Glu Ser Ser Leu He Leu 
g 150 155 160 

'"AAA GCA TCT GAG CAG GGC GAC ACG ATG TTG GGG GAC CTC CTG GAC. AGT 822 
Lys Ala Ser Glu Gin Gly Asp Thr Met Leu Gly Asp Leu Leu Asp Ser 
165 170 175 180 

GAC TGC ACC ACA GGG AGT GGC TCA GGG CTC CCC TTC CTG GTG CAG AGG 870 
Asp Cys Thr Thr Gly Ser Gly Ser Gly Leu Pro Phe Leu Val Gin Arg 
185 190 195 

ACA GTG GCA CGG CAG GTT GCC TTG GTG GAG TGT GTG GGA AAA GGC CGC 918 

Thr Val Ala Arg Gin Val Ala Leu Val Glu Cys Val Gly Lys Gly Arg 
200 205 210 

<? 

TAT GGC GAA GTG TGG CGG GGC TTG TGG CAC GGT GAG AGT GTG GCC GTC 966 

Tyr Gly Glu Val Trp Arg Gly Leu Trp His Gly Glu Ser Val Ala Val 
215 220 225 



AAG ATC TTC TCC TCG AGG GAT GAA CAG TCC TGG TTC CGG GAG ACT GAG 1014 
Lys lie Phe Ser Ser Arg Asp Glu Gin Ser Trp Phe Arg Glu Thr Glu 
230 235 240 

ATC TAT AAC ACA GTA TTG CTC AGA CAC GAC AAC ATC CTA GGC TTC ATC 1062 
lie Tyr Asn Thr Val Leu Leu Arg His Asp Asn lie Leu Gly Phe lie 
245 250 255 260 

GCC TCA GAC ATG ACC TCC CGC AAC TCG AGC ACG CAG CTG TGG CTC ATC 1110 
Ala Ser Asp Met Thr Ser Arg Asn Ser Ser Thr Gin Leu Trp Leu lie 
265 270 275 

ACG CAC TAC CAC GAG CAC GGC TCC CTC TAC GAC TTT CTG CAG AGA CAG 1158 
Thr His Tyr His Glu His Gly Ser Leu Tyr Asp Phe Leu Gin Arg Gin 
280 285 290 

ACG CTG GAG CCC CAT CTG GCT CTG AGG CTA GCT GTG TCC GCG GCA TGC 12 06 

Thr Leu Glu Pro His Leu Ala Leu Arg Leu Ala Val Ser Ala Ala Cys 
295 300 305 

_GGC CTG GCG CAC CTG CAC GTG GAG ATC TTC GGT ACA CAG GGC AAA CCA 1254 
M3ly Leu Ala His Leu His Val Glu lie Phe Gly Thr Gin Gly Lys Pro 
310 315 320 

CfecC ATT GCC CAC CGC GAC TTC AAG AGC CGC AAT GTG CTG GTC AAG AGC 1302 

WAla lie Ala His Arg Asp Phe Lys Ser Arg Asn Val Leu Val Lys Ser 

□325 330 335 • 340 

ffs 

CflAAC CTG CAG TGT TGC ATC GCC GAC CTG GGC CTG GCT GTG ATG CAC TCA 1350 

= Asn Leu Gin Cys Cys lie Ala Asp Leu Gly Leu Ala Val Met His Ser 

£3 345 350 355 

licAG GGC AGC GAT TAC CTG GAC ATC GGC AAC AAC CCG AGA GTG GGC ACC 1398 
i^Gln Gly Ser Asp Tyr Leu Asp lie Gly Asn Asn Pro Arg Val Gly Thr 
hi 360 365 370 

? "AAG CGG TAC ATG GCA CCC GAG GTG CTG GAC GAG CAG ATC CGC ACG GAC 1446 
Lys Arg Tyr Met Ala Pro Glu Val Leu Asp Glu Gin lie Arg Thr Asp 
375 380 385 

TGC TTT GAG TCC TAC AAG TGG ACT GAC ATC TGG GCC TTT GGC CTG GTG 14 94 

Cys Phe Glu Ser Tyr Lys Trp Thr Asp lie Trp Ala Phe Gly Leu Val 
390 395 400 

CTG TGG GAG ATT GCC CGC CGG ACC ATC GTG AAT GGC ATC GTG GAG GAC 1542 
Leu Trp Glu He Ala Arg Arg Thr He Val Asn Gly He Val Glu Asp 
405 410 415 420 

TAT AGA CCA CCC TTC TAT GAT GTG GTG CCC AAT GAC CCC AGC TTT GAG 1590 
Tyr Arg Pro Pro Phe Tyr Asp Val Val Pro Asn Asp Pro Ser Phe Glu 
425 430 435 



GAC ATG AAG AAG GTG GTG TGT GTG GAT CAG CAG ACC CCC ACC ATC CCT 1638 
Asp Met Lys Lys Val Val Cys Val Asp Gin Gin Thr Pro Thr He Pro 
440 445 450 

AAC CGG CTG GCT GCA GAC CCG GTC CTC TCA GGC CTA GCT CAG ATG ATG 1686 
Asn Arg Leu Ala Ala Asp Pro Val Leu Ser Gly Leu Ala Gin Met Met 
455 460 465 

CGG GAG TGC TGG TAG CCA AAC CCC TCT GCC CGA CTC ACC GCG CTG CGG 1734 
Arg Glu Cys Trp Tyr Pro Asn Pro Ser Ala Arg Leu Thr Ala Leu Arg 
470 475 480 

ATC AAG AAG ACA CTA CAA AAA ATT AGC AAC AGT CCA GAG AAG CCT AAA 1782 
He Lys Lys Thr Leu Gin Lys He Ser Asn Ser Pro Glu Lys Pro Lys 
485 490 495 500 

GTG ATT CAA TAGCCCAGGA GCACCTGATT CCTTTCTGCC TGCAGGGGGC 1831 
Val lie Gin 

TGGGGGGGTG GGGGGCAGTG GATGGTGCCC TATCTGGGTA GAGGTAGTGT GAGTGTGGTG 1891 

r*TCTGCTGGGG ATGGGCAGCT GCGCCTGCCT GCTCGGCCCC CAGCCCACCC AGCCAAAAAT 1951 

JSaCAGCTGGGC TGAAACCTGA AAAAAAAAAA AAA 1984 



%(2) INFORMATION FOR SEQ ID NO : 2 : 
jfj (i) SEQUENCE CHARACTERISTICS: 

^ (A) LENGTH: 503 amino acids 

(B) TYPE: amino acid 
O (D) TOPOLOGY: linear 

~"4 (ii) MOLECULE TYPE: protein 

M- ■ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

iL jl, 

QMet Thr Leu Gly Ser Pro Arg Lys Gly Leu Leu Met Leu Leu Met Ala 
M: i 5 10 15 

Leu Val Thr Gin Gly Asp Pro Val Lys Pro Ser Arg Gly Pro Leu. Val 
20 25 30 

Thr Cys Thr Cys Glu Ser Pro His Cys Lys Gly Pro Thr Cys Arg Gly 
35 .40 45 

Ala Trp Cys Thr Val Val Leu Val Arg Glu Glu Gly Arg His Pro Gin 
50 55 60 

? Glu His Arg Gly Cys Gly Asn Leu His Arg Glu Leu Cys Arg Gly Arg 
65 70 75 80 



Pro Thr Glu Phe Val Asn His Tyr Cys Cys Asp Ser His Leu Cys Asn 
85 90 95 



His Asn Val Ser Leu Val Leu Glu Ala Thr Gin Pro Pro Ser Glu Gin 
100 105 110 

Pro Gly Thr Asp Gly Gin Leu Ala Leu He Leu Gly Pro Val Leu Ala 
115 120 125 

Leu Leu Ala Leu Val Ala Leu Gly Val Leu Gly Leu Trp His Val Arg 
130 135 140 

Arg Arg Gin Glu Lys Gin Arg Gly Leu His Ser Glu Leu Gly Glu Ser 
145 150 155 160 

Ser Leu He Leu Lys Ala Ser Glu Gin Gly Asp Thr Met Leu Gly Asp 
165 170 175 

Leu Leu Asp Ser Asp Cys Thr Thr Gly Ser Gly Ser Gly Leu Pro Phe 
180 185 190 

Leu Val Gin Arg Thr Val Ala Arg Gin Val Ala Leu Val Glu Cys Val 
195 . 200 205 

Jgly Lys Gly Arg Tyr Gly Glu Val Trp Arg Gly Leu Trp His Gly Glu 
™ 210 215 220 

^ger Val Ala Val Lys He Phe Ser Ser Arg Asp Glu Gin Ser Trp Phe 

^25 230 235 240 



•=TT 



y&rg Glu Thr Glu He Tyr Asn Thr Val Leu Leu Arg His Asp Asn He 
245 250 255. 



===- 



sLeu Gly Phe He Ala Ser Asp Met Thr Ser Arg Asn Ser Ser Thr Gin 
C3 260 265 270 

|Iaeu Trp Leu He Thr His Tyr His Glu His Gly Ser Leu Tyr Asp Phe 
U 275 280 285 

|Leu Gin Arg Gin Thr Leu Glu Pro His Leu Ala Leu Arg Leu Ala Val 
290 295 300 

Ser Ala Ala Cys Gly Leu Ala His Leu His Val Glu He Phe Gly Thr 
305 310 315 320 

Gin Gly Lys Pro Ala He Ala His Arg Asp Phe Lys Ser Arg Asn Val 
325 330 335 

Leu Val Lys Ser Asn Leu Gin Cys Cys He Ala Asp Leu Gly Leu Ala 
340 345 350 

7 Val Met His Ser Gin Gly Ser Asp Tyr Leu Asp He Gly Asn Asn Pro 
355 360 365 



Arg Val Gly Thr Lys Arg 'Tyr Met Ala Pro Glu Val Leu Asp Glu Gin 
370 375 380 



lie Arg Thr Asp 
385 

Phe Gly Leu Val 



He Val Glu Asp 
420 

Pro Ser Phe Glu 
435 

Pro Thr He Pro 
450 

Ala Gin Met Met 
465 

Thr Ala Leu Arg 



^lu Lys Pro Lys 
500 



Cys Phe Glu Ser 
390 

Leu Trp Glu He 
405 

Tyr Arg Pro Pro 



Asp Met Lys Lys 
440 

Asn Arg Leu Ala 
455 

Arg Glu Cys Trp 
470 

He Lys Lys Thr 
485 

Val He Gin 



Tyr Lys Trp Thr 
395 

Ala Arg Arg Thr 
410 

Phe Tyr Asp Val 
425 

Val Val Cys Val 



Ala Asp Pro Val 
460 

Tyr Pro Asn Pro 
475 

Leu Gin Lys He 
490 



Asp He Trp Ala 
400 

He Val Asn Gly 
415 

Val Pro Asn Asp 
430 

Asp Gin Gin Thr 
445 

Leu Ser Gly Leu 



Ser Ala Arg Leu 
480 

Ser Asn Ser Pro 
495 



□ (2) INFORMATION FOR SEQ ID NO : 3: 
Ifl (i) SEQUENCE CHARACTERISTICS: 

£ Q (A) LENGTH: 2724 base pairs 

(B) TYPE: nucleic acid 
=1 (C) STRANDEDNESS : unknown 

3 (D) TOPOLOGY: linear 

2 . (ii) MOLECULE TYPE: cDNA 

„I (iii) HYPOTHETICAL: NO 

^ (iv) ANTI-SENSE: NO 

; f (v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 104.. 1630 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CTCCGAGTAC CCCAGTGACC AGAGTGAGAG AAGCTCTGAA CGAGGGCACG CGGCTTGAAG 

GACTGTGGGC AGATGTGACC AAGAGCCTGC ATTAAGTTGT ACA ATG GTA GAT GGA 

Met Val Asp Gly 
1 

GTG ATG ATT CTT CCT GTG CTT ATC ATG ATT GCT CTC CCC TCC CCT AGT 
Val Met He Leu Pro Val Leu He Met He Ala Leu Pro Ser Pro Ser 
5 10 15 20 



ATG GAA GAT GAG AAG CCC AAG GTC AAC CCC AAA CTC TAC ATG TGT GTG 211 
Met Glu Asp Glu Lys Pro Lys Val Asn Pro Lys Leu Tyr Met Cys Val 
25 30 35 

TGT GAA GGT CTC TCC TGC GGT AAT GAG GAC CAC TGT GAA GGC CAG CAG 259 
Cys Glu Gly Leu Ser Cys Gly Asn Glu Asp His Cys Glu Gly Gin Gin 
40 45 50 

TGC TTT TCC TCA CTG AGC ATC AAC GAT GGC TTC CAC GTC TAC CAG* AAA 307 
Cys Phe Ser Ser Leu Ser lie Asn Asp Gly Phe His Val Tyr Gin Lys 
55 60 65 

GGC TGC TTC CAG GTT TAT GAG CAG GGA AAG ATG ACC TGT AAG ACC CCG 355 
Gly Cys Phe Gin Val Tyr Glu Gin Gly Lys Met Thr Cys Lys Thr Pro 
70 75 80 

CCG TCC CCT GGC CAA GCT GTG GAG TGC TGC CAA GGG GAC TGG TGT AAC 403 
Pro S^r Pro Gly Gin Ala Val Glu Cys Cys Gin Gly Asp Trp Cys Asn 
85 90 95 100 

AGG AAC ATC ACG GCC CAG CTG CCC ACT AAA GGA AAA TCC TTC CCT GGA 451 
Slrg Asn lie Thr Ala Gin Leu Pro Thr Lys Gly Lys Ser Phe Pro Gly 
v3 105 110 115 

JlCA CAG AAT TTC CAC TTG GAG GTT GGC CTC ATT ATT CTC TCT GTA GTG 499 
Ufhr Gin Asn Phe His Leu Glu Val Gly Leu He He Leu Ser Val Val 
Q 120 125 130 

f§TC GCA GTA TGT CTT TTA GCC TGC CTG CTG GGA GTT GCT CTC CGA AAA 547 
3he Ala Val Cys Leu Leu Ala Cys Leu Leu Gly Val Ala Leu Arg Lys 
™ 135 140 145 

sjTT AAA AGG CGC AAC CAA GAA CGC CTC AAT CCC CGA GAC GTG GAG TAT 595 
[phe Lys Arg Arg Asn Gin Glu Arg Leu Asn Pro Arg Asp Val Glu Tyr 
J~ 150 155 160 

f %GC ACT ATC GAA GGG CTC ATC ACC ACC AAT GTT GGA GAC AGC ACT TTA 643 
Gly Thr He Glu Gly Leu He Thr Thr Asn Val Gly Asp Ser Thr Leu 
165 170 175 180 

GCA GAT TTA TTG GAT CAT TCG TGT ACA TCA GGA AGT GGC TCT GGT CTT 691 
Ala Asp Leu Leu Asp His Ser Cys Thr Ser Gly Ser Gly Ser Gly Leu 
185 190 195 

CCT TTT CTG GTA CAA AGA ACA GTG GCT CGC CAG ATT ACA CTG TTG GAG 739 
Pro Phe Leu Val Gin Arg Thr Val Ala Arg Gin He Thr Leu Leu Glu 
200 205 210 

TGT GTC GGG AAA GGC AGG TAT GGT GAG GTG TGG AGG GGC AGC TGG CAA 787 
Cys Val Gly Lys Gly Arg Tyr Gly Glu Val Trp Arg Gly Ser Trp Gin 
215 220 225 



GGG GAA AAT GTT GCC GTG AAG ATC TTC TCC TCC CGT GAT GAG AAG TCA 835 
Gly Glu Asn Val Ala Val Lys lie Phe Ser Ser Arg Asp Glu Lys Ser 
230 235 240 

TGG TTC AGG GAA ACG GAA TTG TAC AAC ACT GTG ATG CTG AGG CAT GAA 883 
Trp Phe Arg Glu Thr Glu Leu Tyr Asn Thr Val Met Leu Arg His Glu 
245 250 255 260 

AAT ATC TTA GGT TTC ATT GCT TCA GAC ATG ACA TCA AGA CAC TCC AGT 931 
Asn He Leu Gly Phe He Ala Ser Asp Met Thr Ser Arg His Ser Ser 
265 270 275 

ACC CAG CTG TGG TTA ATT ACA CAT TAT CAT GAA ATG GGA TCG TTG TAC 979 
Thr Gin Leu Trp Leu He Thr His Tyr His Glu Met Gly Ser Leu Tyr 
280 285 290 

GAC TAT CTT CAG CTT ACT ACT CTG GAT ACA GTT AGC TGC CTT CGA ATA 1027 
Asp Tyr Leu Gin Leu Thr Thr Leu Asp Thr Val Ser Cys Leu Arg He 
295 300 305 

GTG CTG TCC ATA GCT AGT GGT CTT GCA CAT TTG CAC ATA GAG ATA. TTT 1075 
Val Leu Ser He Ala Ser Gly Leu Ala His Leu His He Glu He Phe 
310 315 320 

GGG ACC CAA GGG AAA CCA GCC ATT GCC CAT CGA GAT TTA AAG AGC AAA 1123 
Gly Thr Gin Gly Lys Pro Ala He Ala His Arg Asp Leu Lys Ser Lys 
325 330 335 340 

AAT ATT CTG GTT AAG AAG AAT GGA CAG TGT TGC ATA GCA GAT TTG GGC 1171 
Asn He Leu Val Lys Lys Asn Gly Gin Cys Cys He Ala Asp Leu Gly 
345 350 355 

CTG GCA GTC ATG CAT TCC CAG AGC ACC AAT CAG CTT GAT GTG GGG AAC 1219 
Leu Ala Val Met His Ser Gin Ser Thr Asn Gin Leu Asp Val Gly Asn 
360 365 370 

AAT CCC CGT GTG GGC ACC AAG CGC TAC ATG GCC CCC GAA GTT CTA GAT 1267 
Asn Pro Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu Val Leu Asp 
375 380 385 

GAA ACC ATC CAG GTG GAT TGT TTC GAT TCT TAT AAA AGG GTC GAT ATT 1315 
Glu Thr He Gin Val Asp Cys Phe Asp Ser Tyr Lys Arg Val Asp He 
390 395 400 

TGG GCC TTT GGA CTT GTT TTG TGG GAA GTG GCC AGG CGG ATG GTG AGC 1363 
Trp Ala Phe Gly Leu Val Leu Trp Glu Val Ala Arg Arg Met Val Ser 
405 410 415 420 

AAT GGT ATA GTG GAG GAT TAC AAG CCA CCG TTC TAC GAT GTG GTT CCC 1411 
Asn Gly He Val Glu Asp Tyr Lys Pro Pro Phe Tyr Asp Val Val Pro 
425 430 435 



AAT GAC CCA AGT TTT GAA GAT ATG AGG AAG GTA GTC TGT GTG GAT CAA 1459 
Asn Asp Pro Ser Phe Glu Asp Met Arg Lys Val Val Cys Val Asp Gin 
440 445 450 

CAA AGG CCA AAC ATA CCC AAC AGA TGG TTC TCA GAC CCG ACA TTA ACC 1507 
Gin Arg Pro Asn lie Pro Asn Arg Trp Phe Ser Asp Pro Thr Leu Thr 
455 460 465 

TCT CTG GCC AAG CTA ATG AAA GAA TGC TGG TAT CAA AAT CCA TCC GCA 1555 
Ser Leu Ala Lys Leu Met Lys Glu Cys Trp Tyr Gin Asn Pro Ser Ala 
470 475 480 

AGA CTC ACA GCA CTG CGT ATC AAA AAG ACT TTG ACC AAA ATT GAT AAT 1603 
Arg Leu Thr Ala Leu Arg lie Lys Lys Thr Leu Thr Lys lie Asp Asn 
485 490 495 500 

TCC CTC GAC AAA TTG AAA ACT GAC TGT TGACATTTTC ATAGTGTCAA 1650 
Ser Leu Asp Lys Leu Lys Thr Asp Cys 
505 

GAAGGAAGAT TTGACGTTGT TGTCATTGTC CAGCTGGGAC CTAATGCTGG CCTGACTGGT 1710 

"f: TGTCAGAATG GAATCCATCT GTCTCCCTCC CCAAATGGCT GCTTTGACAA GGCAGACGTC 1770 

%y GTACCCAGCC ATGTGTTGGG GAGACATCAA AACCACCCTA ACCTCGCTCG ATGACTGTGA 1830 

IU ACTGGGCATT TCACGAACTG TTCACACTGC AGAGACTAAT GTTGGACAGA CACTGTTGCA 1890 

r~; 

EMAAGGTAGGGA CTGGAGGAAC ACAGAGAAAT CCTAAAAGAG ATCTGGGCAT TAAGTCAGTG 1950 

2 GCTTTGCATA GCTTTCACAA GTCTCCTAGA CACTCCCCAC GGGAAACTCA AGGAGGTGGT 2010 

^JGAATTTTTAA TCAGCAATAT TGCCTGTGCT TCTCTTCTTT ATTGCACTAG GAATTCTTTG 2 070 

|=r CATTCCTTAC TTGCACTGTT ACTCTTAATT TTAAAGACCC AACTTGCCAA AATGTTGGCT 2130 

UGCGTACTCCA CTGGTCTGTC TTTGGATAAT AGGAATTCAA TTTGGCAAAA CAAAATGTAA 2190 

TGTCAGACTT TGCTGCATTT TACACATGTG CTGATGTTTA CAATGATGCC GAACATTAGG 2250 

AATTGTTTAT ACACAACTTT GCAAATTATT TATTACTTGT GCACTTAGTA GTTTTTACAA 2310 

AACTGCTTTG TGCATATGTT AAAGCTTATT TTTATGTGGT CTTATGATTT TATTACAGAA 2370 

ATGTTTTTAA CACTATACTC TAAAATGGAC ATTTTCTTTT ATTATCAGTT AAAATCACAT 2430 

TTTAAGTGCT TCACATTTGT ATGTGTGTAG ACTGTAACTT TTTTTCAGTT CATATGCAGA 2490 

1 ACGTATTTAG CCATTACCCA CGTGACACCA CCGAATATAT TATCGATTTA GAAGCAAAGA 2550 

TTTCAGTAGA ATTTTAGTCC TGAACGCTAC GGGGAAAATG CATTTTCTTC AGAATTATCC 2610 

ATTACGTGCA TTTAAACTCT GCCAGAAAAA AATAACTATT TTGTTTTAAT CTACTTTTTG 2670 



TATTTAGTAG TTATTTGTAT AAATTAAATA AACTGTTTTC AAGTCAAAAA AAAA' 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 509 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Val Asp Gly Val Met lie Leu Pro Val Leu lie Met lie Ala Leu 
1 5 10 15 

Pro Ser Pro Ser Met Glu Asp Glu Lys Pro Lys Val Asn Pro Lys Leu 
20 25 30 

Tyr Met Cys Val Cys Glu Gly Leu Ser Cys Gly Asn Glu Asp His Cys 
35 40 45 

Glu Gly Gin Gin Cys Phe Ser Ser Leu Ser lie Asn Asp Gly Phe His 
50 55 60 

Val Tyr Gin Lys Gly Cys Phe Gin Val Tyr Glu Gin Gly Lys Met Thr 
65 70 75 80 

Cys Lys Thr Pro Pro Ser Pro Gly Gin Ala Val Glu Cys Cys Gin Gly 
85 90 95 

Asp Trp Cys Asn Arg Asn lie Thr Ala Gin Leu Pro Thr Lys Gly Lys 
100 105 110 

Ser Phe Pro Gly Thr Gin Asn Phe His Leu Glu Val Gly Leu lie lie 
115 120 125 

Leu Ser Val Val Phe Ala Val Cys Leu Leu Ala Cys Leu Leu Gly Val 
130 135 140 

Ala Leu Arg Lys Phe Lys Arg Arg Asn Gin Glu Arg Leu Asn Pro Arg 
145 150 155 160 

Asp Val Glu Tyr Gly Thr lie Glu Gly Leu lie Thr Thr Asn Val Gly 
165 170 175 

Asp Ser Thr Leu Ala Asp Leu Leu Asp His Ser Cys Thr Ser Gly" Ser 
180 185 190 

Gly Ser Gly Leu Pro Phe Leu Val Gin Arg Thr Val Ala Arg Gin lie 
195 200 205 



Thr Leu Leu Glu Cys Val Gly Lys Gly Arg Tyr Gly Glu Val Trp Arg 
210 215 220 



Gly Ser Trp Gin Gly Glu Asn Val Ala Val Lys lie Phe Ser Ser Arg 
225 230 235 240 

Asp Glu Lys Ser Trp Phe Arg Glu Thr Glu Leu Tyr Asn Thr Val Met 
245 250 255 

Leu Arg His Glu Asn lie Leu Gly Phe lie Ala Ser Asp Met Thr Ser 
260 265 270 

Arg His Ser Ser Thr Gin Leu Trp Leu lie Thr His Tyr His Glu Met 
275 280 285 

Gly Ser Leu Tyr Asp Tyr Leu Gin 1 Leu Thr Thr Leu Asp Thr Val- Ser 
290 295 300 

Cys Leu Arg lie Val Leu Ser lie Ala Ser Gly Leu Ala His Leu His 
305 310 315 320 

lie Glu lie Phe Gly Thr Gin Gly Lys Pro Ala lie Ala His Arg Asp 
325 330 335 

Leu Lys Ser Lys Asn lie Leu Val Lys Lys Asn Gly Gin Cys Cys lie 
340 345 350 

Ala Asp Leu Gly Leu Ala Val Met His Ser Gin Ser Thr Asn Gin Leu 
355 360 365 

Asp Val Gly Asn Asn Pro Arg Val Gly Thr Lys Arg Tyr Met Ala Pro 
370 375 380 

Glu Val Leu Asp Glu Thr He Gin Val Asp Cys Phe Asp Ser Tyr Lys 
385 390 395 400 

Arg Val Asp He Trp Ala Phe Gly Leu Val Leu Trp Glu Val Ala Arg 
405 410 415 

Arg Met Val Ser Asn Gly He Val Glu Asp Tyr Lys Pro Pro Phe Tyr 
420 425 430. 

Asp Val Val Pro Asn Asp Pro Ser Phe Glu Asp Met Arg Lys Val Val 
435 440 445 

Cys Val Asp Gin Gin Arg Pro Asn He Pro Asn Arg Trp Phe Ser Asp 
450 455 460 

Pro Thr Leu Thr Ser Leu Ala Lys Leu Met Lys Glu Cys Trp Tyr Gin 
465 470 475 480 

Asn Pro Ser Ala Arg Leu Thr Ala Leu Arg He Lys Lys Thr Leu Thr 
485 490 495 



Lys He Asp Asn Ser Leu Asp Lys Leu Lys Thr Asp Cys 
500 505 



(2) INFORMATION FOR SEQ ID NO : 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2932 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(v) FRAGMENT TYPE: internal 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 310.. 1905 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GCTCQGCGCC GAGGGCTGGA GGATGCGTTC CCTGGGGTCC GGACTTATGA AAATATGCAT 60 

CAGTTTAATA CTGTCTTGGA ATTCATGAGA TGGAAGCATA GGTCAAAGCT GTTTGGAGAA 12 0 

f^AATCAGAAGT ACAGTTTTAT CTAGCCACAT CTTGGAGGAG TCGTAAGAAA GCAGTGGGAG 180 

TTGAAGTCAT TGTCAAGTGC TTGCGATCTT TTACAAGAAA ATCTCACTGA ATGATAGTCA 240 

;i TTTAAATTGG TGAAGTAGCA AGACCAATTA TTAAAGGTGA CAGTACACAG GAAACATTAC 300 

~L AATTGAACA ATG ACT CAG CTA TAC ATT TAC ATC AGA TTA TTG GGA GCC 348 
^1 Met Thr Gin Leu Tyr lie Tyr lie Arg Leu Leu Gly Ala 

^ 1 5 10 

Q TAT ttG TTC ATC ATT TCT CGT GTT CAA GGA CAG AAT CTG GAT AGT ATG 3 96 

~"4 Tyr Leu Phe lie lie Ser Arg Val Gin Gly Gin Asn Leu Asp Ser Met 
H 15 20 25 

f— 

O CTT CAT GGC ACT GGG ATG AAA TCA GAC TCC GAC CAG AAA AAG TCA GAA 444 

'M= Leu His Gly Thr Gly Met Lys Ser Asp Ser Asp Gin Lys Lys Ser Glu 
30 35 40 45 

AAT GGA GTA ACC TTA GCA CCA GAG GAT ACC TTG CCT TTT TTA AAG TGC 492 
Asn Gly Val Thr Leu Ala Pro Glu Asp Thr Leu Pro Phe Leu Lys Cys 
50 55 60 

TAT TGC TCA GGG CAC TGT CCA GAT GAT GCT ATT AAT AAC ACA TGC ATA 540 
Tyr Cys Ser Gly His Cys Pro Asp Asp Ala lie Asn Asn Thr Cys lie 
65 70 75 

* ACT AAT GGA CAT TGC TTT GCC ATC ATA GAA GAA GAT GAC CAG GGA GAA 588 
Thr Asn Gly His Cys Phe Ala lie lie Glu Glu Asp Asp Gin Gly Glu 
80 85 90 

ACC ACA TTA GCT TCA GGG TGT ATG AAA TAT GAA GGA TCT GAT TTT CAG 636 
Thr Thr Leu Ala Ser Gly Cys Met Lys .Tyr Glu Gly Ser Asp Phe Gin 
95 _ 100 105 



TGC AAA GAT TCT CCA AAA GCC CAG CTA CGC CGG ACA ATA GAA TGT TGT 684 
Cys Lys Asp Ser Pro Lys Ala Gin Leu Arg Arg Thr lie Glu Cys Cys 
110 115 120 125 

CGG ACC AAT TTA TGT AAC CAG TAT TTG CAA CCC ACA CTG CCC CCT GTT 732 
Arg Thr Asn Leu Cys Asn Gin Tyr Leu Gin Pro Thr Leu Pro Pro Val 
130 135 140 

GTC ATA GGT CCG TTT TTT GAT GGC AGC ATT CGA TGG CTG GTT TTG CTC 780 
Val lie Gly Pro Phe Phe Asp Gly Ser lie Arg Trp Leu Val Leu Leu 
145 150 155 

ATT TCT ATG GCT GTC TGC ATA ATT GCT ATG ATC ATC TTC TCC AGC TGC 828 
lie Ser Met Ala Val Cys lie He Ala Met He He Phe Ser Ser Cys 
160 165 170 

TTT TGT TAC AAA CAT TAT TGC AAG AGC ATC TCA AGC AGA CGT CGT TAC 876 
Phe Cys Tyr Lys His Tyr Cys Lys Ser He Ser Ser Arg Arg Arg Tyr 
1^5 180 185 

AAT CGT GAT TTG GAA CAG GAT GAA GCA TTT ATT CCA GTT GGA GAA TCA 924 
f ^ = Asn Arg Asp Leu Glu Gin Asp Glu Ala Phe He Pro Val Gly Glu Ser 
%190 195 200 205 

^CTA AAA GAC CTT ATT GAC CAG TCA CAA AGT TCT GGT AGT GGG TCT GGA 972 
rlLeu Lys Asp Leu He Asp Gin Ser Gin Ser Ser Gly Ser Gly Ser Gly 
y 210 215 220 

l K 

^jCTA CCT TTA TTG GTT CAG CGA ACT ATT GCC AAA CAG ATT CAG ATG GTC 1020 
^Leu Pro Leu Leu Val Gin Arg Thr He Ala Lys Gin He Gin Met Val 
s 225 230 235 

"J CGG CAA GTT GGT AAA GGC CGA TAT GGA GAA GTA TGG ATG GGC AAA TGG 1068 
HArg Gin Val Gly Lys Gly Arg Tyr Gly Glu Val Trp Met Gly Lys Trp 
l« 240 245 250 

Mb CGT GGC GAA AAA GTG GCG GTG AAA GTA TTC TTT ACC ACT GAA GAA GCC 1116 
Arg Gly Glu Lys Val Ala Val Lys Val Phe Phe Thr Thr Glu Glu Ala 
255 260 265 

AGC TGG TTT CGA GAA ACA GAA ATC TAC CAA ACT GTG CTA ATG CGC CAT 1164 
Ser Trp Phe Arg Glu Thr Glu lie Tyr Gin Thr Val Leu Met Arg His 
270 275 280 285 

GAA AAC ATA CTT GGT TTC ATA GCG GCA GAC ATT AAA GGT ACA GGT TCC 1212 
Glu Asn He Leu Gly Phe He Ala Ala Asp He Lys Gly Thr Gly Ser 
290 295 300 

* TGG ACT CAG CTC TAT TTG ATT ACT GAT TAC CAT GAA AAT GGA TCT CTC 1260 
Trp Thr Gin Leu Tyr Leu He Thr Asp Tyr His Glu Asn Gly Ser Leu 
305 310 315 



TAT GAC TTC CTG AAA TGT GCT ACA CTG GAC ACC AGA GCC CTG CTT AAA 13 08 

Tyr Asp Phe Leu Lys Cys Ala Thr Leu Asp Thr Arg Ala Leu Leu Lys 
320 325 330 

TTG GCT TAT TCA GCT GCC TGT GGT CTG TGC CAC CTG CAC ACA GAA ATT 1356 
Leu Ala Tyr Ser Ala Ala Cys Gly Leu Cys His Leu His Thr Glu lie 
335 340 345 

TAT GGC ACC CAA GGA AAG CCC GCA ATT GCT CAT CGA GAC CTA AAG AGC 1404 
Tyr Gly Thr Gin Gly Lys Pro Ala lie Ala His Arg Asp Leu Lys Ser 
350 355 360 365 

AAA AAC ATC CTC ATC AAG AAA AAT GGG AGT TGC TGC ATT GCT GAC CTG 1452 
Lys Asn lie Leu lie Lys Lys Asn Gly Ser Cys Cys lie Ala Asp Leu 
370 375 380 

GGC CTT GCT GTT AAA TTC AAC AGT GAC ACA AAT GAA GTT GAT GTG CCC 1500 
Gly Leu Ala Val Lys Phe Asn Ser Asp Thr Asn Glu Val Asp Val Pro 
* 385 390 395 

TTG AAT ACC AGG GTG GGC ACC AAA CGC TAC ATG GCT CCC GAA GTG CTG 154 8 

r^heu Asn Thr Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu Val Leu 
7n 400 405 410 

JSgAC GAA AGC CTG AAC AAA AAC CAC TTC CAG CCC TAC ATC ATG GCT GAC 1596 
^Asp Glu Ser Leu Asn Lys Asn His Phe Gin Pro Tyr lie Met Ala Asp 
j*f 415 420 425 

^ATC TAC AGC TTC GGC CTA ATC ATT TGG GAG ATG GCT CGT CGT TGT ATC 1644 
^Ile Tyr Ser Phe Gly Leu lie lie Trp Glu Met Ala Arg Arg Cys lie 
L430 435 440 445 

""^ACA GGA GGG ATC GTG GAA GAA TAC CAA TTG CCA TAT TAC AAC ATG GTA 1692 
^hr Gly Gly lie Val Glu Glu Tyr Gin Leu Pro Tyr Tyr Asn Met Val 
H= 450 455 460 

"h^CCG AGT GAT CCG TCA TAC GAA GAT ATG CGT GAG GTT GTG TGT GTC AAA 1740 
Pro Ser Asp Pro Ser Tyr Glu Asp Met Arg Glu Val Val Cys Val Lys 
465 470 475 

CGT TTG CGG CCA ATT GTG TCT AAT CGG TGG AAC AGT GAT GAA TGT CTA 1788 
Arg Leu Arg Pro lie Val Ser Asn Arg Trp Asn Ser Asp Glu Cys Leu 
480 485 490 

CGA GCA GTT TTG AAG CTA ATG TCA GAA TGC TGG GCC CAC AAT CCA GCC 183 6 

Arg Ala Val Leu Lys Leu Met Ser Glu Cys Trp Ala His Asn Pro Ala 
495 500 505 

* TCC AGA CTC ACA GCA TTG AGA ATT AAG AAG ACG CTT GCC AAG ATG GTT 1884 
Ser Arg Leu Thr Ala Leu Arg lie Lys Lys Thr Leu Ala Lys Met Val 
510 515 520 525 



GAA TCC CAA GAT GTA AAA ATC TGATGGTTAA ACCATCGGAG GAGAAACTCT 1935 
Glu Ser Gin Asp Val Lys lie 
530 

AGACTGCAAG AACTGTTTTT ACCCATGGCA TGGGTGGAAT TAGAGTGGAA TAAGGATGTT 1995 

AACTTGGTTC TCAGACTCTT TCTTCACTAC GTGTTCACAG GCTGCTAATA TTAAACCTTT 2 055 

CAGTACTCTT ATTAGGATAC AAGCTGGGAA CTTCTAAACA CTTCATTCTT TATATATGGA 2115 

CAGCTTTATT TTAAATGTGG TTTTTGATGC CTTTTTTTAA GTGGGTTTTT ATGAACTGCA 2175 

TCAAGACTTC AATCCTGATT AGTGTCTCCA GTCAAGCTCT GGGTACTGAA TTGCCTGTTC 2235 

ATAAAACGGT GCTTTCTGTG AAAGCCTTAA GAAGATAAAT GAGCGCAGCA GAGATGGAGA 22 95 

AATAGACTTT GCCTTTTACC TGAGACATTC AGTTCGTTTG TATTCTACCT TTGTAAAACA 2355 

GCCTATAGAT GATGATGTGT TTGGGATACT GCTTATTTTA TGATAGTTTG TCCTGTGTCC 2415 

TTAGTGATGT GTGTGTGTCT CCATGCACAT GCACGCCGGG ATTCCTCTGC TGCCATTTGA 2475 

5ATTAGAAGAA AATAATTTAT ATGCATGCAC AGGAAGATAT TGGTGGCCGG TGGTTTTGTG 253 5 

QCTTTAAAAAT GCAATATCTG ACCAAGATTC GCCAATCTCA TACAAGCCAT TTACTTTGCA 2595 

; 3 I 

S AGTGAGATAG CTTCCCCACC AGCTTTATTT TTTAACATGA AAGCTGATGC CAAGGCCAAA 2655 

lia AGAAGTTTAA AGCATCTGTA AATTTGGACT GTTTTCCTTC AACCACCATT TTTTTTGTGG 2715 

™ TTATTATTTT TGTCACGGAA AGCATCCTCT CCAAAGTTGG AGCTTCTATT GCCATGAACC 2775 

= 7 ATGCTTACAA AGAAAGCACT TCTTATTGAA GTGAATTCCT GCATTTGATA GCAATGTAAG 2835 

™ TGCCTATAAC CATGTTCTAT ATTCTTTATT CTCAGTAACT TTTAAAAGGG AAGTTATTTA 2895 

^TATTTTGTGT ATAATGTGCT TTATTTGCAA ATCACCC 2932 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 532 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Thr Gin Leu Tyr He Tyr He Arg Leu Leu Gly Ala Tyr Leu Phe 
15 10 15 

He He Ser Arg Val Gin Gly Gin Asn Leu Asp Ser Met Leu His Gly 
20 25 30 



Thr Gly Met Lys Ser Asp Ser Asp Gin Lys Lys Ser Glu Asn Gly Val 
35 40 45 



Thr Leu Ala Pro Glu 
50 

Gly His Cys Pro Asp 
65 

His Cys Phe Ala lie 
85 

Ala Ser Gly Cys Met 
100 

Ser Pro Lys Ala Gin 
115 

Leu Cys Asn Gin Tyr 
130 

3 Pro Phe Phe Asp Gly 
1145 

sAla Val Cys He He 
165 

*Lys His Tyr Cys Lys 
180 

Leu Glu Gin Asp Glu 
195 

Leu He Asp Gin Ser 
210 

Leu Val Gin Arg Thr 
225 

Gly Lys Gly Arg Tyr 
245 

Lys Val Ala Val Lys 
260 

Arg Glu Thr Glu He 
275 

Leu Gly Phe He Ala 
290 



Asp Thr Leu Pro Phe Leu 
55 

Asp Ala He Asn Asn Thr 
70 75 

He Glu Glu Asp Asp Gin 
90 

Lys Tyr Glu Gly Ser Asp 
105 

Leu Arg Arg Thr He Glu 
120 

Leu Gin Pro Thr Leu Pro 
135 

Ser He Arg Trp Leu Val 
150 155 

Ala Met He He Phe Ser 
170 

Ser He Ser Ser Arg Arg 
185 

Ala Phe He Pro Val Gly 
200 

Gin Ser Ser Gly Ser Gly 
215 

He Ala Lys Gin He Gin 
230 235 

Gly Glu Val Trp Met Gly 
250 

Val Phe Phe Thr Thr Glu 
265 

Tyr Gin Thr Val Leu Met 
280 

Ala Asp He Lys Gly Thr 
295 



Lys Cys Tyr Cys Ser 
60 

Cys He Thr Asn. Gly 
80 

Gly Glu Thr Thr Leu 
95 

Phe Gin Cys Lys Asp 
110 

Cys Cys Arg Thr Asn 
125 

Pro Val Val He Gly 
140 

Leu Leu He Ser Met 
160 

Ser Cys Phe Cys Tyr 
175 

Arg Tyr Asn Arg Asp 
190 

Glu Ser Leu Lys Asp 
205 

Ser Gly Leu Pro Leu 
220 

Met Val Arg Gin Val 
240 

Lys Trp Arg Gly Glu 
255 

Glu Ala Ser Trp Phe 
270 

Arg His Glu Asn He 
285 

Gly Ser Trp Thr Gin 
300 



Leu Tyr Leu He Thr Asp Tyr His Glu Asn Gly Ser Leu Tyr Asp. Phe 
305 310 315 320 



Leu Lys Cys Ala Thr Leu Asp Thr Arg Ala Leu Leu Lys Leu Ala Tyr 
325 330 335 



Ser Ala Ala Cys Gly Leu Cys His Leu His Thr Glu lie Tyr Gly Thr 
340 345 350 

Gin Gly Lys Pro Ala lie Ala His Arg Asp Leu Lys Ser Lys Asn lie 
355 360 365 

Leu lie Lys Lys Asn Gly Ser Cys Cys lie Ala Asp Leu Gly Leu Ala 
370 375 380 

Val Lys Phe Asn Ser Asp thr Asn Glu Val Asp Val Pro Leu Asn Thr 
385 390 395 400 

Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu Val Leu Asp Glu Ser 
405 410 415 

Leu Asn Lys Asn His Phe Gin Pro Tyr lie Met Ala Asp lie Tyr Ser 
420 425 430 

Phe Gly Leu lie lie Trp Glu Met Ala Arg Arg Cys He Thr Gly Gly 
435 440 445 

He Val Glu Glu Tyr Gin Leu Pro Tyr Tyr Asn Met Val Pro Ser Asp 
450 455 460 

Pro Ser Tyr Glu Asp Met Arg Glu Val Val Cys Val Lys Arg Leu Arg 
465 470 475 480 

Pro He Val Ser Asn Arg Trp Asn Ser Asp Glu Cys Leu Arg Ala Val 
485 490 495 

Leu Lys Leu Met Ser Glu Cys Trp Ala His Asn Pro Ala Ser Arg Leu 
500 505 510 

Thr Ala Leu Arg He Lys Lys Thr Leu Ala Lys Met Val Glu Ser Gin 
515 520 525 



Asp Val Lys He 
530 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2333 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I -SENSE : NO 

(v) FRAGMENT TYPE: internal 
(vi) ORIGINAL SOURCE: 
(A) ORGANISM: Homo sapiens 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1515 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ATG QCG GAG TCG GCC GGA GCC TCC TCC TTC TTC CCC CTT GTT GTC CTC 4 8 

Met Ala Glu Ser Ala Gly Ala Ser Ser Phe Phe Pro Leu Val Val Leu 
15 10 15 

CTG CTC GCC GGC AGC GGC GGG TCC GGG CCC CGG GGG GTC CAG GCT CTG 96 
Leu Leu Ala Gly Ser Gly Gly Ser Gly Pro Arg Gly Val Gin Ala Leu 
20 25 30 

CTG TGT GCG TGC ACC AGC TGC CTC CAG GCC AAC TAC ACG TGT GAG ACA 144 
Leu Cys Ala Cys Thr Ser Cys Leu Gin Ala Asn Tyr Thr Cys Glu Thr 
35 40 45 

GAT GGG GCC TGC ATG GTT TCC TTT TTC AAT CTG GAT GGG ATG GAG CAC 192 
Asp Gly Ala Cys Met Val Ser Phe Phe Asn Leu Asp Gly Met Glu His 
50 55 60 

CAT GTG CGC ACC TGC ATC CCC AAA GTG GAG CTG GTC CCT GCC GGG AAG 240 
His Val Arg Thr Cys lie Pro Lys Val Glu Leu Val Pro Ala Gly Lys 
65 70 75 80 

CCC TTC TAC TGC CTG AGC TCG GAG GAC CTG CGC AAC ACC CAC TGC TGC 288 
Pro Phe Tyr Cys Leu Ser Ser Glu Asp Leu Arg Asn Thr His Cys Cys 
85 90 95 

TAC ACT GAC TAC TGC AAC AGG ATC GAC TTG AGG GTG CCC AGT GGT CAC 336 
Tyr Thr Asp Tyr Cys Asn Arg lie Asp Leu Arg Val Pro Ser Gly His 
100 105 110 

CTC AAG GAG CCT GAG CAC CCG TCC ATG TGG GGC CCG GTG GAG CTG GTA 384 
Leu Lys Glu Pro Glu His Pro Ser Met Trp Gly Pro Val Glu Leu Val 
115 120 125 

GGC ATC ATC GCC GGC CCG GTG TTC CTC CTG TTC CTC ATC ATC ATC ATT 432 
Gly lie lie Ala Gly Pro Val Phe Leu Leu Phe Leu lie lie lie He 
130 ' 135 140 



GTT TTC CTT GTC ATT AAC TAT CAT CAG CGT GTC TAT CAC AAC CGC CAG 4 80 

Val Phe Leu Val lie Asn Tyr His Gin Arg Val Tyr His Asn Arg Gin 
145 150 155 160 

AGA CTG GAC ATG GAA GAT CCC TCA TGT GAG ATG TGT CTC TCC AAA GAC 52 8 

Arg Leu Asp Met Glu Asp Pro Ser Cys Glu Met Cys Leu Ser Lys Asp 
165 170 175 

AAG ACG CTC CAG GAT CTT GTC TAC GAT CTC TCC ACC TCA GGG TCT GGC 576 
Lys Thr Leu Gin Asp Leu Val Tyr Asp Leu Ser Thr Ser Gly Ser Gly 
180 185 190 

TCA GGG TTA CCC CTC TTT GTC CAG CGC ACA GTG GCC CGA ACC ATC GTT 624 
Ser Gly Leu Pro Leu Phe Val Gin Arg Thr Val Ala Arg Thr lie Val 
195 200 205 

TTA CAA GAG ATT ATT GGC AAG GGT CGG TTT GGG GAA GTA TGG CGG GGC 672 
Leu Gin Glu lie lie Gly Lys Gly Arg Phe Gly Glu Val Trp Arg Gly 
210 215 220 

CGC TGG AGG GGT GGT GAT GTG GCT GTG AAA ATA TTC TCT TCT CGT GAA 720 
Arg Trp Arg Gly Gly Asp Val Ala Val Lys lie Phe Ser Ser Arg Glu 
225 230 235 240 

GAA CGG TCT TGG TTC AGG GAA GCA GAG ATA TAC CAG ACG GTC ATG CTG 768 
Glu Arg Ser Trp Phe Arg Glu Ala Glu lie Tyr Gin Thr Val Met Leu 
245 250 255 

CGC CAT GAA AAC ATC CTT GGA TTT ATT GCT GCT GAC AAT AAA GAT AAT 816 
Arg His Glu Asn lie Leu Gly Phe lie Ala Ala Asp Asn Lys Asp Asn 
260 265 270 

GGC ACC TGG ACA CAG CTG TGG CTT GTT TCT GAC TAT CAT GAG CAC GGG 864 
Gly Thr Trp Thr Gin Leu Trp Leu Val Ser Asp Tyr His Glu His Gly 
275 280 285 

TCC CTG TTT GAT TAT CTG AAC CGG TAC ACA GTG ACA ATT GAG GGG ATG 912 
Ser Leu Phe Asp Tyr Leu Asn Arg Tyr Thr Val Thr lie Glu Gly Met 
290 295 300 

ATT AAG CTG GCC TTG TCT GCT GCT AGT GGG CTG GCA CAC CTG CAC ATG 960 
lie Lys Leu Ala Leu Ser Ala Ala Ser Gly Leu Ala His Leu His Met 
305 310 315 320 

GAG ATC GTG GGC ACC CAA GGG AAG CCT GGA ATT GCT CAT CGA GAC TTA 1008 
Glu lie Val Gly Thr Gin Gly Lys Pro Gly lie Ala His Arg Asp Leu 
325 330 335 

AAG TCA AAG AAC ATT CTG GTG AAG AAA AAT GGC ATG TGT GCC ATA GCA 1056 
Lys Ser Lys Asn lie Leu Val Lys Lys Asn Gly Met Cys Ala lie Ala 
340 345 350 



GAC CTG GGC CTG GCT GTC CGT CAT GAT GCA GTC ACT GAC ACC ATT GAC 1104 
Asp Leu Gly Leu Ala Val Arg His Asp Ala Val Thr Asp Thr lie Asp 
355 360 365 

ATT GCC CCG AAT CAG AGG GTG GGG ACC AAA CGA TAC ATG GCC CCT GAA 1152 
lie Ala Pro Asn Gin Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu 
370 375 380 

GTA CTT GAT GAA ACC ATT AAT ATG AAA CAC TTT GAC TCC TTT AAA TGT 1200 
Val Leu Asp Glu Thr lie Asn Met Lys His Phe Asp Ser Phe Lys Cys 
385 390. 395 .400 

GCT GAT ATT TAT GCC CTC GGG CTT GTA TAT TGG GAG ATT GCT CGA AGA 1248 
Ala Asp lie Tyr Ala Leu Gly Leu Val Tyr Trp Glu lie Ala Arg Arg 
405 410 415 

TGC AAT TCT GGA GGA GTC CAT GAA GAA TAT CAG CTG CCA TAT TAC GAC 1296 
Cys Asn Ser Gly Gly Val His Glu Glu Tyr Gin Leu Pro Tyr Tyr Asp 
'* 420 425 430 

TTA GTG CCC TCT GAC CCT TCC ATT GAG GAA ATG CGA AAG GTT GTA TGT 1344 
Leu Val Pro Ser Asp Pro Ser lie Glu Glu Met Arg Lys Val Val Cys 
435 440 445 

GAT CAG AAG CTG CGT CCC AAC ATC CCC AAC TGG TGG CAG AGT TAT GAG 1392 
Asp Gin Lys Leu Arg Pro Asn lie Pro Asn Trp Trp Gin Ser Tyr Glu 
450 455 460 

GCA CTG CGG GTG ATG GGG AAG ATG ATG CGA GAG TGT TGG TAT GCC AAC 1440 
Ala Leu Arg Val Met Gly Lys Met Met Arg Glu Cys Trp Tyr Ala Asn 
465 470 475 480 

GGC GCA GCC CGC CTG ACG GCC CTG CGC ATC AAG AAG ACC CTC TCC CAG 14 88 

Gly Ala Ala Arg Leu Thr Ala Leu Arg lie Lys Lys Thr Leu Ser Gin 
485 490 495 

CTC AGC GTG CAG GAA GAC GTG AAG ATC TAACTGCTCC CTCTCTCCAC 1535 
Leu Ser Val Gin Glu Asp Val Lys lie 
500 505 

ACGGAGCTCC TGGCAGCGAG AACTACGCAC AGCTGCCGCG TTGAGCGTAC GATGGAGGCC 1595 

TACCTCTCGT TTCTGCCCAG CCCTCTGTGG CCAGGAGCCC TGGCCCGCAA GAGGGACAGA 1655 

GCCCGGGAGA GACTCGCTCA CTCCCATGTT GGGTTTGAGA CAGACACCTT TTCTATTTAC 1715 

CTCCTAATGG CATGGAGACT CTGAGAGCGA ATTGTGTGGA GAACTCAGTG CCACACCTCG 1775 

AACTGGTTGT AGTGGGAAGT CCCGCGAAAC CCGGTGCATC TGGCACGTGG CCAGGAGCCA 1835 

TGACAGGGGC GCTTGGGAGG GGCCGGAGGA ACCGAGGTGT TGCCAGTGCT AAGCTGCCCT 1895 

GAGGGTTTCC TTCGGGGACC AGCCCACAGC ACACCAAGGT GGCCCGGAAG AACCAGAAGT 1955 



GCAGCCCCTC TCACAGGCAG CTCTGAGCCG CGCTTTCCCC TCCTCCCTGG GATGGACGCT 2015 

GCCGGGAGAC TGCCAGTGGA GACGGAATCT GCCGCTTTGT CTGTCCAGCC GTGTGTGCAT 2075 

GTGCCGAGGT GCGTCCCCCG TTGTGCCTGG TTCGTGCCAT GCCCTTACAC GTGCGTGTGA 2135 

GTGTGTGTGT GTGTCTGTAG GTGCGCACTT ACCTGCTTGA GCTTTCTGTG CATGTGCAGG 2195 

TCGGGGGTGT GGTCGTCATG CTGTCCGTGC TTGCTGGTGC CTCTTTTCAG TAGTGAGCAG 2255 

CATCTAGTTT CCCTGGTGCC CTTCCCTGGA GGTCTCTCCC TCCCCCAGAG CCCCTCATGC 2315 

CACAGTGGTA CTCTGTGT 2333 

(2) INFORMATION FOR SEQ ID NO: 8: 

,(i) SEQUENCE CHARACTERISTICS: 

' (A) LENGTH: 505 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
Q (ii) MOLECULE TYPE: protein 

U3 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

fj Met Ala Glu Ser Ala Gly Ala Ser Ser Phe Phe Pro Leu Val Val Leu 
[d 1 5 10 15 

ff% Leu Leu Ala Gly Ser Gly Gly Ser Gly Pro Arg Gly Val Gin Ala Leu 
m 20 25 30 

jjU Leu Cys Ala Cys Thr Ser Cys Leu Gin Ala Asn Tyr Thr Cys Glu Thr 
35 40 45 

fT Asp Gly Ala Cys Met Val Ser Phe Phe Asn Leu Asp Gly Met Glu His 

IZ. 50 55 60 

His Val Arg Thr Cys lie Pro Lys Val Glu Leu Val Pro Ala Gly Lys 
65 70 75 80 

Pro Phe Tyr Cys Leu Ser Ser Glu Asp Leu Arg Asn Thr His Cys Cys 
85 90 95 

Tyr Thr Asp Tyr Cys Asn Arg lie Asp Leu Arg Val Pro Ser Gly His 
100 105 110 

Leu Lys Glu Pro Glu His Pro Ser Met Trp Gly Pro Val Glu Leu- Val 
115 120 125 

Gly He He Ala Gly Pro Val Phe Leu Leu Phe Leu He He He He 
130 135 140 

Val Phe Leu Val He Asn Tyr His Gin Arg Val Tyr His Asn Arg Gin 
145 150 155 160 



Arg Leu Asp Met Glu Asp Pro Ser Cys Glu Met Cys Leu Ser Lys Asp 
165 170 175 



Lys Thr Leu Gin Asp Leu Val Tyr Asp Leu Ser Thr Ser Gly Ser Gly 
180 185 190 

Ser Gly Leu Pro Leu Phe Val Gin Arg Thr Val Ala Arg Thr lie Val 
195 200 205 

Leu Gin Glu lie lie Gly Lys Gly Arg Phe Gly Glu Val Trp Arg Gly 
210 215 220 

Arg Trp Arg Gly Gly Asp Val Ala Val Lys lie Phe Ser Ser Arg, Glu 
225 230 235 240 

Glu Arg Ser Trp Phe Arg Glu Ala Glu lie Tyr Gin Thr Val Met Leu 
245 250 255 

Arg #is Glu Asn lie Leu Gly Phe lie Ala Ala Asp Asn Lys Asp Asn 
260 265 270 

3 Gly Thr Trp Thr Gin Leu Trp Leu Val Ser Asp Tyr His Glu His Gly 
H 275 280 285 

Fx 

Ser Leu Phe Asp Tyr Leu Asn Arg Tyr Thr Val Thr lie Glu Gly Met 
1 290 295 300 



lie Lys Leu Ala Leu Ser Ala Ala Ser Gly Leu Ala His Leu His Met 

310 315 320 



Glu He Val Gly Thr Gin Gly Lys Pro Gly He Ala His Arg Asp Leu 



325 



330 



335 



A Lys Ser Lys Asn He Leu Val Lys Lys Asn Gly Met Cys Ala He Ala 
~ 340 345 350 

Asp Leu Gly Leu Ala Val Arg His Asp Ala Val Thr Asp Thr He Asp 
355 360 365 

He Ala Pro Asn Gin Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu 
370 375 380 

Val Leu Asp Glu Thr He Asn Met Lys His Phe Asp Ser Phe Lys Cys 
385 390 395 400 

Ala Asp He Tyr Ala Leu Gly Leu Val Tyr Trp Glu He Ala Arg Arg 
405 410 415 



Cys Asn Ser Gly Gly Val His Glu Glu Tyr Gin Leu Pro Tyr Tyr Asp 
420 425 430 



Leu Val Pro Ser Asp Pro Ser He Glu Glu Met Arg Lys Val Val Cys 
435 440 445 



Asp Gin Lys Leu Arg Pro Asn lie Pro Asn Trp Trp Gin Ser Tyr Glu 
450 455 460 

Ala Leu Arg Val Met Gly Lys Met Met Arg Glu Cys Trp Tyr Ala Asn 
465 470 475 480 

Gly Ala Ala Arg Leu Thr Ala Leu Arg lie Lys Lys Thr Leu Ser Gin 
485 490 495 

Leu Ser Val Gin Glu Asp Val Lys lie 
500 505 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2308 base pairs 

(B) TYPE: nucleic acid 

'* (C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 
, (iii) HYPOTHETICAL: NO 

f (iv). ANTI- SENSE: NO 

f (v) FRAGMENT TYPE: internal 

? (vi) ORIGINAL SOURCE: 

I (A) ORGANISM: Mouse 

[ (ix) FEATURE: 

? (A) NAME/ KEY : CDS 

I (B) LOCATION: 77.. 1585 

? (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

I GGCGAGGCGA GGTTTGCTGG GGTGAGGCAG CGGCGCGGCC GGGCCGGGCC GGGCCACAGG 60 

CGGTGGCGGC GGGACC ATG GAG GCG GCG GTC GCT GCT CCG CGT CCC CGG 109 
Met Glu Ala Ala Val Ala Ala Pro Arg Pro Arg 
15 10 

CTG CTC CTC CTC GTG CTG GCG GCG GCG GCG GCG GCG GCG GCG GCG CTG 157 
Leu Leu Leu Leu Val Leu Ala Ala Ala Ala Ala Ala Ala Ala Ala Leu 
15 20 25 

CTC CCG GGG GCG ACG GCG TTA CAG TGT TTC TGC CAC CTC TGT ACA AAA 205 
Leu Pro Gly Ala Thr Ala Leu Gin Cys Phe Cys His Leu Cys Thr Lys 
30 35 40 

GAC AAT TTT ACT TGT GTG ACA GAT GGG CTC TGC TTT GTC TCT GTC ACA 253 
Asp Asn Phe Thr Cys Val Thr Asp Gly Leu Cys Phe Val Ser Val Thr 
45 50 55 

GAG ACC ACA GAC AAA GTT ATA CAC AAC AGC ATG TGT ATA GCT GAA ATT 301 
Glu Thr Thr Asp Lys Val He His Asn Ser Met Cys He Ala Glu He 
60 65 70 75 



GAC TTA ATT CCT CGA GAT AGG CCG TTT GTA TGT GCA CCC TCT TCA AAA 349 
Asp Leu lie Pro Arg Asp Arg Pro Phe Val Cys Ala Pro Ser Ser Lys 
80 85 90 

ACT GGG TCT GTG ACT ACA ACA TAT TGC TGC AAT CAG GAC CAT TGC AAT 3 97 

Thr Gly Ser Val Thr Thr Thr Tyr Cys Cys Asn Gin Asp His Cys Asn 
95 100 105 

AAA ATA GAA CTT CCA ACT ACT GTA AAG TCA TCA CCT GGC CTT GGT CCT 445 
Lys lie Glu Leu Pro Thr Thr Val Lys Ser Ser Pro Gly Leu Gly Pro 
110 115 120 

GTG GAA CTG GCA GCT GTC ATT GCT GGA CCA GTG TGC TTC GTC TGC ATC 4 93 

Val Glu Leu Ala Ala Val lie Ala Gly Pro Val Cys Phe Val Cys lie 
125 130 135 

TCA CTC ATG TTG ATG GTC TAT ATC TGC CAC AAC CGC ACT GTC ATT CAC 541 
Ser Leu Met Leu Met Val Tyr He Cys His Asn Arg Thr Val He His 
140 * 145 150 155 

CAT CGA GTG CCA AAT GAA GAG GAC CCT TCA TTA GAT CGC CCT TTT ATT 589 
^His Arg Val Pro Asn Glu Glu Asp Pro Ser Leu Asp Arg Pro Phe He 

1^0 165 170 

J^fTCA GAG GGT ACT ACG TTG AAA GAC TTA ATT TAT GAT ATG ACA ACG TCA 637 
;«( Ser Glu Gly Thr Thr Leu Lys Asp Leu He Tyr Asp Met Thr Thr Ser 
175 180 185 

EH GGT TCT GGC TCA GGT TTA CCA TTG CTT GTT CAG AGA ACA ATT GCG AGA 685 
SU Gly Ser Gly Ser Gly Leu Pro Leu Leu Val Gin Arg Thr He Ala Arg 
s_ 190 195 200 

SjACT ATT GTG TTA CAA GAA AGC ATT GGC AAA GGT CGA TTT GGA GAA GTT 733 
M=Thr He Val Leu Gin Glu Ser He Gly Lys Gly Arg Phe Gly Glu Val 
U 205 210 215 

p 

jLjL, TGG AGA GGA AAG TGG CGG GGA GAA GAA GTT GCT GTT AAG ATA TTC TCC 781 
Trp Arg Gly Lys Trp Arg Gly Glu Glu Val Ala Val Lys He Phe. Ser 
220 225 230 235 

TCT AGA GAA GAA CGT TCG TGG TTC CGT GAG GCA GAG ATT TAT CAA ACT 829 
Ser Arg Glu Glu Arg Ser Trp Phe Arg Glu Ala Glu He Tyr Gin Thr 
240 245 250 

GTA ATG TTA CGT CAT GAA AAC ATC CTG GGA TTT ATA GCA GCA GAC AAT 877 
Val Met Leu Arg His Glu Asn He Leu Gly Phe He Ala Ala Asp Asn 
255 260 265 

9 AAA GAC AAT GGT ACT TGG ACT CAG CTC TGG TTG GTG TCA GAT TAT CAT 925 
Lys Asp Asn Gly Thr Trp Thr Gin Leu Trp Leu Val Ser Asp Tyr His 
270 275 280 



GAG CAT GGA TCC CTT TTT GAT TAC TTA AAC AGA TAG ACA GTT ACT GTG 973 
Glu His Gly Ser Leu Phe Asp Tyr Leu Asn Arg Tyr Thr Val Thr Val 
285 290 295 

GAA GGA ATG ATA AAA CTT GCT CTG TCC ACG GCG AGC GGT CTT GCC CAT 1021 
Glu Gly Met lie Lys Leu Ala Leu Ser Thr Ala Ser Gly Leu Ala His 
300 305 310 315 

CTT CAC ATG GAG ATT GTT GGT ACC CAA GGA AAG CCA GCC ATT GCT CAT 1069 
Leu His Met Glu lie Val Gly Thr Gin Gly Lys Pro Ala He Ala His 
320 325 330 

AGA GAT TTG AAA TCA AAG AAT ATC TTG GTA AAG AAG AAT GGA ACT TGC 1117 
Arg Asp Leu Lys Ser Lys Asn He Leu Val Lys Lys Asn Gly Thr Cys 
335 340 345 

TGT ATT GCA GAC TTA GGA CTG GCA GTA AGA CAT GAT TCA GCC ACA GAT 1165 
Cys He Ala Asp Leu Gly Leu Ala Val Arg His Asp Ser Ala Thr Asp 
* 350 355 360 

ACC ATT GAT ATT GCT CCA AAC CAC AGA GTG GGA ACA AAA AGG TAC ATG 1213 

f^hr lie Asp He Ala Pro Asn His Arg Val Gly Thr Lys Arg Tyr Met 
" 365 370 375 

JgSCC CCT GAA GTT CTC GAT GAT TCC ATA AAT ATG AAA CAT TTT GAA TCC 1261 

HAla Pro Glu Val Leu Asp Asp Ser He Asn Met Lys His Phe Glu Ser 
%80 385 390 395 

^MlTC AAA CGT GCT GAC ATC TAT- GCA ATG GGC TTA GTA TTC TGG GAA ATT 1309 
^Qphe Lys Arg Ala Asp He Tyr Ala Met Gly Leu Val Phe Trp Glu He 
s 400 405 410 

'--hCT CGA CGA TGT TCC ATT GGT GGA ATT CAT GAA GAT TAC CAA CTG CCT 1357 
HAla Arg Arg Cys Ser He Gly Gly He His Glu Asp Tyr Gin Leu Pro 
U 415 420 425 

I^TAT TAT GAT CTT GTA CCT TCT GAC CCA TCA GTT GAA GAA ATG AGA AAA 1405 
Tyr Tyr Asp Leu Val Pro Ser Asp Pro Ser Val Glu Glu Met Arg Lys 
430 435 440 

GTT GTT TGT GAA CAG AAG TTA AGG CCA AAT ATC CCA AAC AGA TGG CAG 1453 
Val Val Cys Glu Gin Lys Leu Arg Pro Asn He Pro Asn Arg Trp Gin 
445 450 455 

AGC TGT GAA GCC TTG AGA GTA ATG GCT AAA ATT ATG AGA GAA TGT TGG 1501 
Ser Cys Glu Ala Leu Arg Val Met Ala Lys He Met Arg Glu Cys Trp 
460 465 470 475 

* TAT GCC AAT GGA GCA GCT AGG CTT ACA GCA TTG CGG ATT AAG AAA ACA 1549 
Tyr Ala Asn Gly Ala Ala Arg Leu Thr Ala Leu Arg He Lys Lys Thr 
480 485 490 




TTA TCG CAA CTC AGT CAA CAG GAA GGC ATC AAA ATG TAATTCTACA 1595 
Leu Ser Gin Leu Ser Gin Gin Glu Gly lie Lys Met 
495 500 

GCTTTGCCTG AACTCTCCTT TTTTCTTCAG ATCTGCTCCT GGGTTTTAAT TTGGGAGGTC 1655 

AGTTGTTCTA CCTCACTGAG AGGGAACAGA AGGATATTGC TTCCTTTTGC AGCAGTGTAA 1715 

TAAAGTCAAT TAAAAACTTC CCAGGATTTC TTTGGACCCA GGAAACAGCC ATGTGGGTCC 1775 

TTTCTGTGCA CTATGAACGC TTCTTTCCCA GGACAGAAAA TGTGTAGTCT ACCTTTATTT 1835 

TTTATTAACA AAACTTGTTT TTTAAAAAGA TGATTGCTGG TCTTAACTTT AGGTAACTCT 1895 

GCTGTGCTGG AGATCATCTT TAAGGGCAAA GGAGTTGGAT TGCTGAATTA CAATGAAACA 1955 

TGTCTTATTA CTAAAGAAAG TGATTTACTC CTGGTTAGTA CATTCTCAGA GGATTCTGAA 2015 

CCACTAGAGT TTCCTTGATT CAGACTTTGA ATGTACTGTT CTATAGTTTT TCAGGATCTT 2 075 

AAAACTAACA CTTATAAAAC TCTTATCTTG AGTCTAAAAA TGACCTCATA TAGTAGTGAG 2135 

j*S GAACATAATT CATGCAATTG TATTTTGTAT ACTATTATTG TTCTTTCACT TATTCAGAAC 2195 

ATTACATGCC TTCAAAATGG GATTGTACTA TACCAGTAAG TGCCACTTCT GTGTCTTTCT 2255 

: . = 

AATGGAAATG AGTAGAATTG CTGAAAGTCT CTATGTTAAA ACCTATAGTG TTT 23 08 

y s 

iLiU: 

s m (2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
y (A) LENGTH: 503 amino acids 

^ (B) TYPE: amino acid 

H (D) TOPOLOGY: linear 

13 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Glu Ala Ala Val Ala Ala Pro Arg Pro Arg Leu Leu Leu Leu Val 
15 10 15. 

Leu Ala Ala Ala Ala Ala Ala Ala Ala Ala Leu Leu Pro Gly Ala Thr 
20 25 30 



Ala Leu Gin Cys Phe Cys His Leu Cys Thr Lys Asp Asn Phe Thr Cys 
35 40 45 



Val Thr Asp Gly Leu Cys Phe Val Ser Val Thr Glu Thr Thr Asp Lys 



50 



55 



60 



Val lie His Asn Ser Met Cys lie Ala Glu lie Asp Leu lie Pro Arg 
65 70 75 80 



Asp Arg Pro Phe Val Cys Ala Pro Ser Ser Lys Thr Gly Ser Val Thr 
85 90 95 



Thr Thr Tyr Cys Cys Asn Gin Asp His Cys Asn Lys lie Glu Leu Pro 
. 100 105 110 

Thr Thr Val Lys Ser Ser Pro Gly Leu Gly Pro Val Glu Leu Ala Ala 
115 120 125 

Val lie Ala Gly Pro Val Cys Phe Val Cys lie Ser Leu Met Leu Met 
130 135 140 

Val Tyr lie Cys His Asn Arg Thr Val lie His His Arg Val Pro Asn 
145 150 155 160 

Glu Glu Asp Pro Ser Leu Asp Arg Pro Phe lie Ser Glu Gly Thr Thr 
165 170 175 

Leu Lys Asp Leu lie Tyr Asp Met Thr Thr Ser Gly Ser Gly Ser Gly 
180 185 190 

Leu Pro Leu Leu Val Gin Arg Thr lie Ala Arg Thr lie Val Leu Gin 
195 200 205 

Glu Ser He Gly Lys Gly Arg Phe Gly Glu Val Trp Arg Gly Lys Trp 
210 215 220 



Arg Gly Glu Glu Val Ala Val Lys 
225 230 

Ser Trp Phe Arg Glu Ala Glu He 
245 

Glu Asn He Leu Gly Phe He Ala 
260 

Trp Thr Gin Leu Trp Leu Val Ser 
275 280 



He Phe Ser Ser Arg Glu Glu Arg 
235 240 

Tyr Gin Thr Val Met Leu Arg His 
250 255 

Ala Asp Asn Lys Asp Asn Gly Thr 
265 270 

Asp Tyr His Glu His Gly Ser Leu 
285 



Phe Asp Tyr Leu Asn Arg Tyr Thr Val Thr Val Glu Gly Met He Lys 
290 295 300 

Leu Ala Leu Ser Thr Ala Ser Gly Leu Ala His Leu His Met Glu He 

305 310 315 320 



Val Gly Thr Gin Gly Lys Pro Ala He Ala His Arg Asp Leu Lys Ser 
325 330 335 

Lys Asn He Leu Val Lys Lys Asn Gly Thr Cys Cys He Ala Asp Leu 
340 345 350 

Gly Leu Ala Val Arg His Asp Ser Ala Thr Asp Thr He Asp He Ala 
355 360 365 



\ 




Pro Asn His Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu Val Leu 

370 375 380 

Asp Asp Ser He Asn Met Lys His Phe Glu Ser Phe Lys Arg Ala Asp 

385 390 395 400 



He Tyr Ala Met Gly Leu Val Phe Trp Glu He Ala Arg Arg Cys Ser 
405 410 415 

He Gly Gly He His Glu Asp Tyr Gin Leu Pro Tyr Tyr Asp Leu Val 
420 425 430 



Pro Ser Asp Pro Ser Val Glu Glu Met Arg Lys Val Val Cys Glu Gin 
435 440 445 

Lys Leu Arg Pro Asn He Pro Asn Arg Trp Gin Ser Cys Glu Ala Leu 

450 455 460 

Arg 'Val Met Ala Lys He Met Arg Glu Cys Trp Tyr Ala Asn Gly Ala 

465 470 475 -480 



Ala Arg Leu Thr Ala Leu Arg He Lys Lys Thr Leu Ser Gin Leu Ser 
.485 490 495 

Gin Gin Glu Gly He Lys Met 
500 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1922 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mouse 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 241.. 1746 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

. GAGAGCACAG CCCTTCCCAG TCCCCGGAGC CGCCGCGCCA CGCGCGCATG ATCAAGACCT 60 

TTTCCCCGGC CCCACAGGGC CTCTGGACGT GAGACCCCGG CCGCCTCCGC AAGGAGAGGC 120 

GGGGGTCGAG TCGCCCTGTC CAAAGGCCTC AATCTAAACA ATCTTGATTC CTGTTGCCGG 180 

CTGGCGGGAC CCTGAATGGC AGGAAATCTC ACCACATCTC TTCTCCTATC TCCAAGGACC 240 



ATG ACC TTG GGG AGC TTC AGA AGG GGC CTT TTG ATG CTG TCG GTG GCC 
Met Thr Leu Gly Ser Phe Arg Arg Gly Leu Leu Met Leu Ser Val Ala 
15 10 15 

TTG GGC CTA ACC CAG GGG AGA CTT GCG AAG CCT TCC AAG CTG GTG AAC 
Leu Gly Leu Thr Gin Gly Arg Leu Ala Lys Pro Ser Lys Leu Val Asn 
20 25 30 

TGC ACT TGT GAG AGC CCA CAC TGC AAG AGA CCA TTC TGC CAG GGG TCA 
Cys Thr Cys Glu Ser Pro His Cys Lys Arg Pro Phe Cys Gin Gly Ser 
35 40 45 

TGG TGC ACA GTG GTG CTG GTT CGA GAG CAG GGC AGG CAC CCC CAG GTC 
Trp Cys Thr Val Val Leu Val Arg Glu Gin Gly Arg His Pro Gin Val 
50 55 60 



TAT CGG GGC TGT GGG AGC CTG AAC CAG GAG CTC TGC TTG GGA CGT CCC 
Tyr Arg Gly Cys Gly Ser Leu Asn Gin Glu Leu Cys Leu Gly Arg Pro 



65 



70 



75 



80 



ACG GAG TTT CTG AAC CAT CAC TGC TGC TAT AGA TCC TTC TGC AAC CAC 
Thr Glu Phe Leu Asn His His Cys Cys Tyr Arg Ser Phe Cys Asn His 
85 90 95 

AAC GTG TCT CTG ATG CTG GAG GCC ACC CAA ACT CCT TCG GAG GAG CCA 
Asn Val Ser Leu Met Leu Glu Ala Thr Gin Thr Pro Ser Glu Glu Pro 
100 105 110 

GAA GTT GAT GCC CAT CTG CCT CTG ATC CTG GGT CCT GTG CTG GCC TTG 
Glu Val Asp Ala His Leu Pro Leu lie Leu Gly Pro Val Leu Ala Leu 
115 120 125 

CCG GTC CTG GTG GCC CTG GGT GCT CTG GGC TTG TGG CGT GTC CGG CGG 
Pro Val Leu Val Ala Leu Gly Ala Leu Gly Leu Trp Arg Val Arg Arg 
130 135 140 

AGG CAG GAG AAG CAG CGG GAT TTG CAC AGT GAC CTG GGC GAG TCC AGT 
Arg Gin Glu Lys Gin Arg Asp Leu His Ser Asp Leu Gly Glu Ser Ser 
145 150 155 160 

CTC ATC CTG AAG GCA TCT GAA CAG GCA GAC AGC ATG TTG GGG GAC TTC 
Leu lie Leu Lys Ala Ser Glu Gin Ala Asp Ser Met Leu Gly Asp Phe 
165 170 175 



CTG GAC AGC GAC TGT ACC ACG GGC AGC GGC TCG GGG CTC CCC TTC TTG 
Leu Asp Ser Asp Cys Thr Thr Gly Ser Gly Ser Gly Leu Pro Phe Leu 
180 185 190 



GTG CAG AGG ACG GTA GCT CGG CAG GTT GCG CTG GTA GAG TGT GTG GGA 
Val Gin Arg Thr Val Ala Arg Gin Val Ala Leu Val Glu Cys Val Gly 
195 200 205 



AAG GGC CGA TAT GGC GAG GTG TGG CGC GGT TCG TGG CAT GGC GAA AGC 912 
Lys Gly Arg Tyr Gly Glu Val Trp Arg Gly Ser Trp His Gly Glu Ser 
210 215 220 

GTG GCG GTC AAG ATT TTC TCC TCA CGA GAT GAG CAG TCC TGG TTC CGG 960 
Val Ala Val Lys lie Phe Ser Ser Arg Asp Glu Gin Ser Trp Phe Arg 
225 230 235 240 

GAG ACG GAG ATC TAC AAC ACA GTT CTG CTT AGA CAC GAC AAC ATC CTA 1008 
Glu Thr Glu lie Tyr Asn Thr Val Leu Leu Arg His Asp Asn lie Leu 
245 250 255 

GGC TTC ATC GCC TCC GAC ATG ACT TCG CGG AAC TCG AGC ACG CAG CTG 1056 
Gly Phe lie Ala Ser Asp Met Thr Ser Arg Asn Ser Ser Thr Gin Leu 
260 265 270 

TGG CTC ATC ACC CAC TAC CAT GAA CAC GGC TCC CTC TAT GAC TTT CTG 1104 
Trp Leu lie Thr His Tyr His Glu His Gly Ser Leu Tyr Asp Phe Leu 
'* 275 280 285 

CAG AGG CAG ACG CTG GAG CCC CAG TTG GCC CTG AGG CTA GCT GTG TCC 1152 
Gin Arg Gin Thr Leu Glu Pro Gin Leu Ala Leu Arg Leu Ala Val- Ser 
290 295 300 

CCG GCC TGC GGC CTG GCG CAC CTA CAT GTG GAG ATC TTT GGC ACT CAA 1200 
H Pro Ala Cys Gly Leu Ala His Leu His Val Glu He Phe Gly Thr Gin 
305 310 315 320 

: z 

[J1 GGC AAA CCA GCC ATT GCC CAT CGT GAC CTC AAG AGT CGC AAT GTG CTG 124 8 

w Gly Lys Pro Ala He Ala His Arg Asp Leu Lys Ser Arg Asn Val Leu 
s 325 330 335 

o 

S 4 GTC AAG AGT AAC TTG CAG TGT TGC ATT GCA GAC CTG GGA CTG GCT GTG 1296 
H Val Lys Ser Asn Leu Gin Cys Cys He Ala Asp Leu Gly Leu Ala Val 
M< 340 345 350 

U ATG CAC TCA CAA AGC AAC GAG TAC CTG GAT ATC GGC AAC ACA CCC CGA 1344 
Met His Ser Gin Ser Asn Glu Tyr Leu Asp He Gly Asn Thr Pro Arg 
355 360 365 

GTG GGT ACC AAA AGA TAC ATG GCA CCC GAG GTG CTG GAT GAG CAC ATC 13 92 

Val Gly Thr Lys Arg Tyr Met Ala Pro Glu Val Leu Asp Glu His He 
370 375 380 

CGC ACA GAC TGC TTT GAG TCG TAC AAG TGG ACA GAC ATC TGG GCC TTT 1440 
Arg Thr Asp Cys Phe Glu Ser Tyr Lys Trp Thr Asp He Trp Ala Phe 
385 390 395 400 

* GGC CTA GTG CTA TGG GAG ATC GCC CGG CGG ACC ATC ATC AAT GGC ATT 14 88 

Gly Leu Val Leu Trp Glu He Ala Arg Arg Thr He He Asn Gly He 
405 410 415 




GTG GAG GAT TAC AGG 
Val Glu Asp Tyr Arg 
420 

AGT TTT GAG GAC ATG 
Ser Phe Glu Asp Met 
435 

ACC ATC CCT AAC CGG 
Thr lie Pro Asn Arg 
450 

CAG ATG ATG AGA GAG 
Gin Met Met Arg Glu 
465 

GCA CTG CGC ATA AAG 
Ala Leu Arg lie Lys 
*' 4 85 

AAG CCC AAA GTG ATT 
rhys Pro Lys Val lie 

%. 500 

J^AAGTGTGTG CTGGGGAAGA AGACATAGCC TGTCTGGGTA GAGGGAGTGA AGAGAGTGTG 1836 
I^ACGCTGCCC TGTGTGTGCC TGCTCAGCTT GCTCCCAGCC CATCCAGCCA AAAATACAGC 1896 
^JGAGCTGAAA TTCAAAAAAA AAAAAA 1922 



H2) INFORMATION FOR SEQ ID NO: 12: 
M (i) SEQUENCE CHARACTERISTICS: 

C3 (A) LENGTH: 502 amino acids 

M (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Thr Leu Gly Ser Phe Arg Arg Gly Leu Leu Met Leu Ser Val Ala 
15 10 15 

Leu Gly Leu Thr Gin Gly Arg Leu Ala Lys Pro Ser Lys Leu Val Asn 
20 25 30 

Cys Thr Cys Glu Ser Pro His Cys Lys Arg Pro Phe Cys Gin Gly Ser 
35 40 45 



CCA CCT TTC TAT GAC ATG GTA CCC AAT GAC CCC 1536 
Pro Pro Phe Tyr Asp Met Val Pro Asn Asp Pro 
425 430 

AAA AAG GTG GTG TGC GTT GAC CAG CAG ACA CCC 1584 
Lys Lys Val Val Cys Val Asp Gin Gin Thr Pro 
440 445 

CTG GCT GCA GAT CCG GTC CTC TCC GGG CTG GCC 1632 
Leu Ala Ala Asp Pro Val Leu Ser Gly Leu Ala 
455 460 

TGC TGG TAC CCC AAC CCC TCT GCT CGC CTC ACC 1680 
Cys Trp Tyr Pro Asn Pro Ser Ala Arg Leu Thr 
470 475 480 

AAG ACA TTG CAG AAG CTC AGT CAC AAT CCA GAG 172 8 

Lys Thr Leu Gin Lys Leu Ser His Asn Pro Glu 
490 495 

CAC TAGCCCAGGG CCACCAGGCT TCCTCTGCCT 1776 
His 



Trp Cys Thr Val Val Leu Val Arg Glu Gin Gly Arg His Pro Gin Val 
50 > 55 60 



Tyr Arg Gly Cys Gly Ser Leu Asn Gin Glu Leu Cys Leu Gly Arg Pro 
65 70 75 80 



Thr Glu Phe Leu Asn His His Cys Cys Tyr Arg Ser Phe Cys Asn His 
85 90 95 

Asn Val Ser Leu Met Leu Glu Ala Thr Gin Thr Pro Ser Glu Glu Pro 
100 105 110 

Glu Val Asp Ala His Leu Pro Leu lie Leu Gly Pro Val Leu Ala Leu 
115 120 125 

Pro Val Leu Val Ala Leu Gly Ala Leu Gly Leu Trp Arg Val Arg Arg 
130 135 140 

Arg Gin Glu Lys Gin Arg Asp Leu His Ser Asp Leu Gly Glu Ser Ser 
145 150 155 160 

Leu fie Leu Lys Ala Ser Glu Gin Ala Asp Ser Met Leu Gly Asp Phe 
165 170 175 

r _Leu Asp Ser Asp Cys Thr Thr Gly Ser Gly Ser Gly Leu Pro Phe Leu 
f 180 185 190 

^Val Gin Arg Thr Val Ala Arg Gin Val Ala Leu Val Glu Cys Val Gly 
* 195 200 205 

if 

3 Lys Gly Arg Tyr Gly Glu Val Trp Arg Gly Ser Trp His Gly Glu Ser 
1 210 215 220 

Val Ala Val Lys lie Phe Ser Ser Arg Asp Glu Gin Ser Trp Phe Arg 
J225 230 235 240 

^Glu Thr Glu lie Tyr Asn Thr Val Leu Leu Arg His Asp Asn lie Leu 
^ 245 250 255 

iGly Phe lie Ala Ser Asp Met Thr Ser Arg Asn Ser Ser Thr Gin Leu 
260 265 270 

Trp Leu lie Thr His Tyr His Glu His Gly Ser Leu Tyr Asp Phe Leu 
275 280 285 

Gin Arg Gin Thr Leu Glu Pro Gin Leu Ala Leu Arg Leu Ala Val Ser 
290 295 300 

Pro Ala Cys Gly Leu Ala His Leu His Val Glu lie Phe Gly Thr Gin 
305 310 315 " 320 



Gly Lys Pro Ala lie Ala His Arg Asp Leu Lys Ser Arg Asn Val Leu 
325 330 335 



Val Lys Ser Asn Leu Gin Cys Cys lie Ala Asp Leu Gly Leu Ala Val 
340 345 350 



Met His Ser Gin Ser Asn Glu Tyr Leu Asp lie Gly Asn Thr Pro Arg 
355 360 365 



Val Gly Thr Lys Arg Tyr Met Ala Pro Glu Val Leu Asp Glu His lie 
370 375 380 

Arg Thr Asp Cys Phe Glu Ser Tyr Lys Trp Thr Asp lie Trp Ala Phe 
385 390 395 400 

Gly Leu Val Leu Trp Glu lie Ala Arg Arg Thr lie lie Asn Gly lie 
405 . 410 415 

Val Glu Asp Tyr Arg Pro Pro Phe Tyr Asp Met Val Pro Asn Asp Pro 
420 425 430 

Ser Phe Glu Asp Met Lys Lys Val Val Cys Val Asp Gin Gin Thr Pro 
435 440 445 

Thr lie Pro Asn Arg Leu Ala Ala Asp Pro Val Leu Ser Gly Leu Ala 
450 455 460 

Gin Met Met Arg Glu Cys Trp Tyr Pro Asn Pro Ser Ala Arg Leu Thr 
465 470 475 480 

Ala Leu Arg lie Lys Lys Thr Leu Gin Lys Leu Ser His Asn Pro Glu 
485 490 495 



Lys Pro Lys Val lie His 
500 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) - SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2070 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 
(vi) ORIGINAL SOURCE: 
(A) ORGANISM: Mouse 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 217.. 1812 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

ATTCATGAGA TGGAAGCATA GGTCAAAGCT GTTCGGAGAA ATTGGAACTA CAGTTTTATC 
TAGCCACATC TCTGAGAATT CTGAAGAAAG CAGCAGGTGA AAGTCATTGC CAAGTGATTT 
TGTTCTGTAA_ GGAAGCCTCC CTCATTCACT TACACCAGTG AGACAGCAGG ACCAGTCATT 



CAAAGGGCCG TGTACAGGAC GCGTGGCAAT CAGACA ATG ACT CAG CTA TAC ACT 234 

Met Thr Gin Leu Tyr Thr 
1 5 

TAC ATC AGA TTA CTG GGA GCC TGT CTG TTC ATC ATT TCT CAT GTT CAA 2 82 

Tyr lie Arg Leu Leu Gly Ala Cys Leu Phe lie lie Ser His Val Gin 
10 15 20 

GGG CAG AAT CTA GAT AGT ATG CTC CAT GGC ACT GGT ATG AAA TCA GAC 330 
Gly Gin Asn Leu Asp Ser Met Leu His Gly Thr Gly Met Lys Ser Asp 
25 30 35 

TTG GAC CAG AAG AAG CCA GAA AAT GGA GTG ACT TTA GCA CCA GAG GAT 378 
Leu Asp Gin Lys Lys Pro Glu Asn Gly Val Thr Leu Ala Pro Glu Asp 
40 45 50 

ACC TTG CCT TTC TTA AAG TGC TAT TGC TCA GGA CAC TGC CCA GAT GAT 426 
Thr Leu Pro Phe Leu Lys Cys Tyr Cys Ser Gly His Cys Pro Asp Asp 
55 * 60 65 70 

GCT ATT AAT AAC ACA TGC ATA ACT AAT GGC CAT TGC TTT GCC ATT ATA 474 
Ala lie Asn Asn Thr Cys lie Thr Asn Gly His Cys Phe Ala lie lie 
75 80 85 

%B GAA GAA GAT GAT CAG GGA GAA ACC ACA TTA ACT TCT GGG TGT ATG AAG 522 
Q Glu Glu Asp Asp Gin Gly Glu Thr Thr Leu Thr Ser Gly Cys Met Lys 
W 90 95 100 

Cm TAT GAA GGC TCT GAT TTT CAA TGC AAG GAT TCA CCG AAA GCC CAG CTA 570 
Cy Tyr Glu Gly Ser Asp Phe Gin Cys Lys Asp Ser Pro Lys Ala Gin Leu 
= 105 110 115 

%j CGC AGG ACA ATA GAA TGT TGT CGG ACC AAT TTG TGC AAC CAG TAT TTG 618 

Arg Arg Thr lie Glu Cys Cys Arg Thr Asn Leu Cys Asn Gin Tyr Leu 
\ A 120 125 130 

il CAG CCT ACA CTG CCC CCT GTT GTT ATA GGT CCG TTC TTT GAT GGC AGC 666 
Gin Pro Thr Leu Pro Pro Val Val lie Gly Pro Phe Phe Asp Gly Ser 
135 140 145 150 

ATC CGA TGG CTG GTT GTG CTC ATT TCC ATG GCT GTC TGT ATA GTT GCT 714 
lie Arg Trp Leu Val Val Leu lie Ser Met Ala Val Cys lie Val Ala 
155 160 165 

ATG ATC ATC TTC TCC AGC TGC TTT TGC TAT AAG CAT TAT TGT AAG AGT 762 
Met lie lie Phe Ser Ser Cys Phe Cys Tyr Lys His Tyr Cys Lys Ser 
170 175 180 

9 ATC TCA AGC AGG GGT CGT TAC AAC CGT GAT TTG GAA CAG GAT GAA- GCA 810 
lie Ser Ser Arg Gly Arg Tyr Asn Arg Asp Leu Glu Gin Asp Glu Ala 
185 190 195 



TTT ATT CCA GTA GGA GAA TCA TTG AAA GAC CTG ATT GAC CAG TCC CAA 858 
Phe lie Pro Val Gly Glu Ser Leu Lys Asp Leu lie Asp Gin Ser Gin 
200 205 210 

AGC TCT GGG AGT GGA TCT GGA TTG CCT TTA TTG GTT CAG CGA ACT ATT 906 
Ser Ser Gly Ser Gly Ser Gly Leu Pro Leu Leu Val Gin Arg Thr lie 
215 220 225 230 

GCC AAA CAG ATT CAG ATG GTT CGG CAG GTT GGT AAA GGC CGC TAT GGA 954 
Ala Lys Gin lie Gin Met Val Arg Gin Val Gly Lys Gly Arg Tyr Gly 
235 240 245 

GAA GTA TGG ATG GGT AAA TGG CGT GGT GAA AAA GTG GCT GTC AAA GTG 1002 
Glu Val Trp Met Gly Lys Trp Arg Gly Glu Lys Val Ala Val Lys Val 
250 255 260 

TTT TTT ACC ACT GAA GAA GCT AGC TGG TTT AGA GAA ACA GAA ATC TAG 1050 
Phe Phe Thr Thr Glu Glu Ala Ser Trp Phe Arg Glu Thr Glu lie Tyr 
* 265 270 275 

CAG ACG GTG TTA ATG CGT CAT GAA AAT ATA CTT GGT TTT ATA GCT GCA 1098 
m Gin Thr Val Leu Met Arg His Glu Asn lie Leu Gly Phe lie Ala Ala 
7? 280 285 290 

«f GAC ATT AAA GGC ACT GGT TCC TGG ACT CAG CTG TAT TTG ATT ACT GAT 1146 

Asp lie Lys Gly Thr Gly Ser Trp Thr Gin Leu Tyr Leu lie Thr Asp 
W 295 300 305 310 

|H TAC . CAT GAA AAT GGA TCT CTC TAT GAC TTC CTG AAA TGT GCC ACA CTA 1194 
CO Tyr His Glu Asn Gly Ser Leu Tyr Asp Phe Leu Lys Cys Ala Thr Leu 
= 315 320 325 

-J GAC ACC AGA GCC CTA CTC AAG TTA GCT TAT TCT GCT GCT TGT GGT CTG 1242 

Asp. Thr Arg Ala Leu Leu Lys Leu Ala Tyr Ser Ala Ala Cys Gly Leu 

L 330 335 340 

p 

U TGC CAC CTC CAC ACA GAA ATT TAT GGT ACC CAA GGG AAG CCT GCA ATT 1290 

Cys His Leu His Thr Glu lie Tyr Gly Thr Gin Gly Lys Pro Ala He 
345 350 35.5 

GCT CAT CGA GAC CTG AAG AGC AAA AAC ATC CTT ATT AAG AAA AAT GGA 1338 
Ala His Arg Asp Leu Lys Ser Lys Asn He Leu He Lys Lys Asn Gly 
360 365 370 

AGT TGC TGT ATT GCT GAC CTG GGC CTA GCT GTT AAA TTC AAC AGT GAT 1386 
Ser Cys Cys He Ala Asp Leu Gly Leu Ala Val Lys Phe Asn Ser Asp 
375 380 385 390 

* ACA AAT GAA GTT GAC ATA CCC TTG AAT ACC AGG GTG GGC ACC AAG CGG 1434 
Thr Asn Glu Val Asp He Pro Leu Asn Thr Arg Val Gly Thr Lys Arg 
395 400 405 



TAC 
Tyr 


ATG 
Met 


GCT 
Ala 


CCA 
Pro 
410 


GAA 
Glu 


GTG 
Val 


CTG 
Leu 


GAT 
Asp 


GAA 
Glu 
415 


AGC 
Ser 


CTG 
Leu 


AAT 
Asn 


AAA 
Lys 


AAC 
Asn 
420 


CAT 
His 


TTC 
Phe 


1482 


CAG 
Gin 


CCC 
Pro 


TAC 
Tyr 
425 


ATC 
He 


ATG 
Met 


GCT 
Ala 


GAC 
Asp 


ATC 
He 
430 


TAT 
Tyr 


AGC 
Ser 


TTT GGT 
Phe Gly 


TTG 
Leu 
435 


ATC 
He 


ATT 
He. 


TGG 
Trp 


1530 


GAA 
Glu 


ATG 
Met 
440 


GCT 
Ala 


CGT 
Arg 


CGT 
Arg 


TGT 
Cys 


ATT 
He 
445 


ACA 
Thr 


GGA 
Gly 


GGA 
Gly 


ATC 
He 


GTG 
Val 

a c r\ 


GAG 
Glu 


GAA 
Glu 


TAT 
Tyr 


CAA 
Gin 


1578 


TTA 
Leu 
455 


CCA 
Pro 


TAT 
Tyr 


TAC 
Tyr 


AAC 
Asn 


ATG 
Met 
460 


GTG 
Val 


CCC 
Pro 


AGT 
Ser 


GAC 
Asp 


CCA 
Pro 


TCC 
Ser 


TAT 
Tyr 


GAG 
Glu 


GAC 
Asp 


ATG 
Met 
470 


1626 


CGT 
Arg 


GAG 
Glu 


GTT 
Val 


GTG 
Val 


TGT 
Cys 
475 


GTG 
Val 


AAA 
Lys 


CGC 
Arg 


TTG 
Leu 


CGG 
Arg 
480 


CCA 
Pro 


ATC 
He 


GTG 
Val 


TCT 
Ser 


AAC 
Asn 
485 


CGC 
Arg 


1674 


TGG 


AAC 
Asn 


AGC 
Ser 


GAT 
Asp 
490 


GAA 
Glu 


TGT 
Cys 


CTT 
Leu 


CGA 
Arg 


GCA 
Ala 
495 


GTT 
Val 


TTG 
Leu 


AAG 
Lys 


CTA 
Leu 


ATG 
Met 
500 


TCA 
Ser 


GAA 
Glu 


1722 


JjfTGT 
~™Cys 


TGG 
Trp 


GCC 
Ala 
505 


CAT 
His 


AAT 
Asn 


CCA 
Pro 


GCC 
Ala 


TCC 
Ser 
510 


AGA 
Arg 


CTC 
Leu 


ACA 
Thr 


GCT 
Ala 


TTG 
Leu 
515 


AGA 
Arg 


ATC 
He 


AAG 
Lys 


1770 


CfkAG 
CQLys 


ACA 
Thr 
520 


CTT 
Leu 


GCA 
Ala 


AAA 
Lys 


ATG 
Met 


GTT 
Val 
525 


GAA 
Glu 


TCC 
Ser 


CAG 
Gin 


GAT GTA 
Asp Val 
530 


AAG 
Lys 


ATT 
He 






1812 



SjTGACAATTAA ACAATTTTGA GGGAGAATTT AGACTGCAAG AACTTCTTCA CCCAAGGAAT 1872 

^JjGGTGGGATT AGCATGGAAT AGGATGTTGA CTTGGTTTCC AGACTCCTTC CTCTACATCT 1932 

lItCACAGGCTG CTAACAGTAA ACCTTACCGT ACTCTACAGA ATACAAGATT GGAACTTGGA 1992 

ACTTCAAACA TGTCATTCTT TATATATGAC AGCTTTGTTT TAATGTGGGG TTTTTTTGTT 2052 

TGCTTTTTTT GTTTTGTT 2070 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 532 amino acids 

(B) TYPE: amino acid 
* (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 

Met Thr Gin Leu Tyr Thr Tyr He Arg Leu Leu Gly Ala Cys Leu Phe 
1 5 10 15 



lie lie Ser His Val Gin Gly Gin Asn Leu Asp Ser Met Leu His Gly 
20 25 30 

Thr Gly Met Lys Ser Asp Leu Asp Gin Lys Lys Pro Glu Asn Gly Val 
35 40 45 

Thr Leu Ala Pro Glu Asp Thr Leu Pro Phe Leu Lys Cys Tyr Cys Ser 
50 55 60 

Gly His Cys Pro Asp Asp Ala lie Asn Asn Thr Cys lie Thr Asn Gly 
65 70 75 80 

His Cys Phe Ala lie lie Glu Glu Asp Asp Gin Gly Glu Thr Thr Leu 
85 90 95 

Thr Ser Gly Cys Met Lys Tyr Glu Gly Ser Asp Phe Gin Cys Lys Asp 
100 105 110 

Ser Pifo Lys Ala Gin Leu Arg Arg Thr lie Glu Cys Cys Arg Thr Asn 
115 120 125 

^Leu Cys Asn Gin Tyr Leu Gin Pro Thr Leu Pro Pro Val Val He Gly 
t 130 135 140 

.Ji 

^Pro Phe Phe Asp Gly Ser He Arg Trp Leu Val Val Leu He Ser Met 
={145 150 155 160 

^Ala Val Cys He Val Ala Met He He Phe Ser Ser Cys Phe Cys Tyr 
I- 165 170 175 

^Lys His Tyr Cys Lys Ser He Ser Ser Arg Gly Arg Tyr Asn Arg Asp 
3 180 185 190 

-J 

=*Leu Glu Gin Asp Glu Ala Phe He Pro Val Gly Glu Ser Leu Lys Asp 
& 195 200 205 

-J! 

jLLeu He Asp Gin Ser Gin Ser Ser Gly Ser Gly Ser Gly Leu Pro Leu 
210 215 220 

Leu Val Gin Arg Thr He Ala Lys Gin He Gin Met Val Arg Gin Val 
225 230 235 240 

Gly Lys Gly Arg Tyr Gly Glu Val Trp Met Gly Lys Trp Arg Gly Glu 
245 250 255 

Lys Val Ala Val Lys Val Phe Phe Thr Thr Glu Glu Ala Ser Trp Phe 
260 265 270 

"Arg Glu Thr Glu He Tyr Gin Thr Val Leu Met Arg His Glu Asn He 
275 280 285 



Leu Gly Phe He Ala Ala Asp He Lys Gly Thr Gly Ser Trp Thr Gin 
290 295 300 



Leu Tyr Leu He Thr Asp Tyr His Glu Asn Gly Ser Leu Tyr Asp Phe 
305 310 315 320 

Leu Lys Cys Ala Thr Leu Asp Thr Arg Ala Leu Leu Lys Leu Ala Tyr 
325 330 335 

Ser Ala Ala Cys Gly Leu Cys His Leu His Thr Glu He Tyr Gly Thr 
340 345 350 

Gin Gly Lys Pro Ala He Ala His Arg Asp Leu Lys Ser Lys Asn He 
355 360 365 

Leu He Lys Lys Asn Gly Ser Cys Cys He Ala Asp Leu Gly Leu Ala 
370 375 380 

Val Lys Phe Asn Ser Asp Thr Asn Glu Val Asp He Pro Leu Asn Thr 
385 390 395 400 

Arg Vkl Gly Thr Lys Arg Tyr Met Ala Pro Glu Val Leu Asp Glu Ser 
405 410 415 

Leu Asn Lys Asn His Phe Gin Pro Tyr He Met Ala Asp He Tyr Ser 
420 425 430 

Phe Gly Leu He He Trp Glu Met Ala Arg Arg Cys He Thr Gly Gly 
435 440 445 

He Val Glu Glu Tyr Gin Leu Pro Tyr Tyr Asn Met Val Pro Ser Asp 
450 455 460 

Pro Ser Tyr Glu Asp Met Arg Glu Val Val Cys Val Lys Arg Leu Arg 
465 470 475 480 

Pro He Val Ser Asn Arg Trp Asn Ser Asp Glu Cys Leu Arg Ala Val 
485 490 495 

Leu Lys Leu Met Ser Glu Cys Trp Ala His Asn Pro Ala Ser Arg Leu 
500 505 510 



Thr Ala Leu Arg He Lys Lys Thr Leu Ala Lys Met Val Glu Ser Gin 
515 520 525 



Asp Val Lys He 
530 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2160 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 
(A) ORGANISM: Mouse 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 10.. 1524 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CGCGGTTAC ATG GCG GAG TCG GCC GGA GCC TCC TCC TTC TTC CCC CTT 
'< Met Ala Glu Ser Ala Gly Ala Ser Ser Phe Phe Pro Leu 
1 5 10 

GTT GTC CTC CTG CTC GCC GGC AGC GGC GGG TCC GGG CCC CGG GGG ATC 
3 Val Val Leu Leu Leu Ala Gly Ser Gly Gly Ser Gly Pro Arg Gly lie 
1 15 20 25 

1CAG GCT CTG CTG TGT GCG TGC ACC AGC TGC CTA CAG ACC AAC TAC ACC 
jGln Ala Leu Leu Cys Ala Cys Thr Ser Cys Leu Gin Thr Asn Tyr Thr 
3 30 35 40 45 

] TGT GAG ACA GAT GGG GCT TGC ATG GTC TCC ATC TTT AAC CTG GAT GGC 
Cys Glu Thr Asp Gly Ala Cys Met Val Ser He Phe Asn Leu Asp Gly 
1 50 55 60 

5 GTG GAG CAC CAT GTA CGT ACC TGC ATC CCC AAG GTG GAG CTG GTT CCT 
Val Glu His His Val Arg Thr Cys He Pro Lys Val Glu Leu Val Pro 
I 65 70 75 

GCT GGA AAG CCC TTC TAC TGC CTG AGT TCA GAG GAT CTG CGC AAC ACA 
Ala Gly Lys Pro Phe Tyr Cys Leu Ser Ser Glu Asp Leu Arg Asn Thr 
80 85 90 

CAC TGC TGC TAT ATT GAC TTC TGC AAC AAG ATT GAC CTC AGG GTC CCC 
His Cys Cys Tyr He Asp Phe Cys Asn Lys He Asp Leu Arg Val - Pro 
95 100 105 

AGC GGA CAC CTC AAG GAG CCT GCG CAC CCC TCC ATG TGG GGC CCT GTG 
Ser Gly His Leu Lys Glu Pro Ala His Pro Ser Met Trp Gly Pro Val 
HO 115 120 125 



GAG CTG GTC GGC ATC ATC GCC GGC CCC GTC TTC CTC CTC TTC CTT ATC 
Glu Leu Val Gly He He Ala Gly Pro Val Phe Leu Leu Phe Leu He 
130 , 135 140 



ATT ATC ATC GTC TTC CTG GTC ATC AAC TAT CAC CAG CGT GTC TAC CAT 480 
lie lie lie Val Phe Leu Vai lie Asn Tyr His Gin Arg Val Tyr His 
145 150 155 

AAC CGC CAG AGG TTG GAC ATG GAG GAC CCC TCT TGC GAG ATG TGT CTC 52 8 

Asn Arg Gin Arg Leu Asp Met Glu Asp Pro Ser Cys Glu Met Cys Leu 
160 165 170 

TCC AAA GAC AAG ACG CTC CAG GAT CTC GTC TAC GAC CTC TCC ACG TCA 576 
Ser Lys Asp Lys Thr Leu Gin Asp Leu Val Tyr Asp Leu Ser Thr Ser 
175 180 185 

GGG TCT GGC TCA GGG TTA CCC CTT TTT GTC CAG CGC ACA GTG GCC CGA 624 
Gly Ser Gly Ser Gly Leu Pro Leu Phe Val Gin Arg Thr Val Ala Arg 
190 195 200 205 

ACC ATT GTT TTA CAA GAG ATT ATC GGC AAG GGC CGG TTC GGG GAA GTA 672 
Thr lie Val Leu Gin Glu lie lie Gly Lys Gly Arg Phe Gly Glu Val. 
'* 210 215 220 

TGG CGT GGT CGC TGG AGG GGT GGT GAC GTG GCT GTG AAA ATC TTC TCT 72 0 

_^Trp Arg Gly Arg Trp Arg Gly Gly Asp Val Ala Val Lys He Phe Ser 
225 230 235 

^fTCT CGT GAA GAA CGG TCT TGG TTC CGT GAA GCA GAG ATC TAC CAG ACC 768 

Ser Arg Glu Glu Arg Ser Trp Phe Arg Glu Ala Glu He Tyr Gin Thr 
W 240 245 250 

CP GTC ATG CTG CGC CAT GAA AAC ATC CTT GGC TTT ATT GCT GCT GAC AAT 816 
EyVal Met Leu Arg His Glu Asn He Leu Gly Phe He Ala Ala Asp Asn 
3 255 260 265 

"-j AAA GAT AAT GGC ACC TGG ACC CAG CTG TGG CTT GTC TCT GAC TAT CAC 864 
L^Lys Asp Asn Gly Thr Trp Thr Gin Leu Trp Leu Val Ser Asp Tyr His 
1^270 275 280 285 

^GAG CAT GGC TCA CTG TTT GAT TAT CTG AAC CGC TAC ACA GTG ACC ATT 912 
Glu His Gly Ser Leu Phe Asp Tyr Leu Asn Arg Tyr Thr Val Thr He 
290 295 300 

GAG GGA ATG ATT AAG CTA GCC TTG TCT GCA GCC AGT GGT TTG GCA CAC 960 
Glu Gly Met He Lys Leu Ala Leu Ser Ala Ala Ser Gly Leu Ala His 
305 310 315 

CTG CAT ATG GAG ATT GTG GGC ACT CAA GGG AAG CCG GGA ATT GCT CAT 1008 
Leu His Met Glu He Val Gly Thr Gin Gly Lys Pro Gly He Ala His 
320 325 330 

* CGA GAC TTG AAG TCA AAG AAC ATC CTG GTG AAA AAA AAT GGC ATG TGT 1056 
Arg Asp Leu Lys Ser Lys Asn He Leu Val Lys Lys Asn Gly Met Cys 
335 340 345 



GCC ATT GCA GAC CTG GGC CTG GCT GTC CGT CAT GAT GCG GTC ACT GAC 1104 
Ala lie Ala Asp Leu Gly Leu Ala Val Arg His Asp Ala Val Thr Asp 
350 355 360 365 

ACC ATA GAC ATT GCT CCA AAT CAG AGG GTG GGG ACC AAA CGA TAC ATG 1152 
Thr lie Asp lie Ala Pro Asn Gin Arg Val Gly Thr Lys Arg Tyr Met 
370 375 380 

GCT CCT GAA GTC CTT GAC GAG ACA ATC AAC ATG AAG CAC TTT GAC TCC 12 00 

Ala Pro Glu Val Leu Asp Glu Thr lie Asn Met Lys His Phe Asp Ser 
385 390 395 

TTC AAA TGT GCC GAC ATC TAT GCC CTC GGG CTT GTC TAC TGG GAG ATT 1248 
Phe Lys Cys Ala Asp lie Tyr Ala Leu Gly Leu Val Tyr Trp Glu lie 
400 405 410 

GCA CGA AGA TGC AAT TCT GGA GGA GTC CAT GAA GAC TAT CAA CTG CCG 1296 
Ala Arg Arg Cys Asn Ser Gly Gly Val His Glu Asp Tyr Gin Leu Pro 
415 420 425 

TAT TAC GAC TTA GTG CCC TCC GAC CCT TCC ATT GAG GAG ATG CGA AAG 1344 
^ Tyr Tyr Asp Leu Val Pro Ser Asp Pro Ser He Glu Glu Met Arg Lys 
"™ 8 430 435 440 445 

i : s 

%y GTT GTA TGT GAC CAG AAG CTA CGG CCC AAT GTC CCC AAC TGG TGG CAG 13 92 

%J Val Val Cys Asp Gin Lys Leu Arg Pro Asn Val Pro Asn Trp Trp Gin 
W 450 455 460 

CO AGT TAT GAG GCC TTG CGA GTG ATG GGA AAG ATG ATG CGG GAG TGC TGG 1440 
M Ser Tyr Glu Ala Leu Arg Val Met Gly Lys Met Met Arg Glu Cys Trp 
= 465 470 475 

SjTAC GCC AAT GGT GCT GCC CGT CTG ACA GCT CTG CGC ATC AAG AAG ACT 1488 
^ Tyr Ala Asn Gly Ala Ala Arg Leu Thr Ala Leu Arg He Lys Lys Thr 
M= 480 485 490 

JTCTG TCC CAG CTA AGC GTG CAG GAA GAT GTG AAG ATT TAAGCTGTTC 1534 
Leu Ser Gin Leu Ser Val Gin Glu Asp Val Lys He 



495 




500 




505 






CTCTGCCTAC 


ACAAAGAACC 


TGGGCAGTGA 


GGATGACTGC 


AGCCACCGTG 


CAAGCGTCGT 


1594 


GGAGGCCTAT 


CCTCTTGTTT 


CTGCCCGGCC 


CTCTGGCAGA 


GCCCTGGCCT 


GCAAGAGGGA 


1654 


CAGAGCCTGG 


GAGACGCGCG 


CACTCCCGTT 


GGGTTTGAGA 


CAGACACTTT 


TTATATTTAC 


1714 


CTCCTGATGG 


CATGGAGACC 


TGAGCAAATC 


ATGTAGTCAC 


TCAATGCCAC 


AACTCAAACT 


1774 


9 GCTTCAGTGG 


GAAGTACAGA 


GACCCAGTGC 


ATTGCGTGTG 


CAGGAGCGTG 


AGGTGCTGGG 


1834 


CTCGCCAGGA 


GCGGCCCCCA 


TACCTTGTGG 


TCCACTGGGC 


TGCAGGTTTT 


CCTCCAGGGA 


1894 


CCAGTCAACT 


GGCATCAAGA 


TATTGAGAGG 


AACCGGAAGT 


TTCTCCCTCC 


TTCCCGTAGC 


1954 




AGTCCTGAGC CACACCATCC TTCTCATGGA CATCCGGAGG ACTGCCCCTA GAGACACAAC 2 014 

CTGCTGCCTG TCTGTCCAGC CAAGTGCGCA TGTGCCGAGG TGTGTCCCAC ATTGTGCCTG 2 074 

GTCTGTGCCA CGCCCGTGTG TGTGTGTGTG TGTGTGAGTG AGTGTGTGTG TGTACACTTA 2134 

ACCTGCTTGA GCTTCTGTGC ATGTGT 2160 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 505 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 16: 

Met Ala Glu Ser Ala Gly Ala Ser Ser Phe Phe Pro Leu Val Val Leu 
1 5 10 15 

Leu Leu Ala Gly Ser Gly Gly Ser Gly Pro Arg Gly lie Gin Ala Leu 

^ 20 , 25 30 

r! Leu Cys Ala Cys Thr Ser Cys Leu Gin Thr Asn Tyr Thr Cys Glu Thr 
^ 35 40 45 

^ Asp Gly Ala Cys Met Val Ser lie Phe Asn Leu Asp Gly Val Glu His 
VS 50 55 60 

P His Val Arg Thr Cys lie Pro Lys Val Glu Leu Val Pro Ala Gly Lys 

M 65 70 75 80 

M Pro Phe Tyr Cys Leu Ser Ser Glu Asp Leu Arg Asn Thr His Cys - Cys 
Q 85 90 95 

Tyr lie Asp Phe Cys Asn Lys lie Asp Leu Arg Val Pro Ser Gly His 
100 105 110 

Leu Lys Glu Pro Ala His Pro Ser Met Trp Gly Pro Val Glu Leu Val 
115 120 125 

Gly He He Ala Gly Pro Val Phe Leu Leu Phe Leu He He He He 
130 135 140 

Val Phe Leu Val He Asn Tyr His Gin Arg Val Tyr His Asn Arg Gin 
145 150 155 160 

-? 

Arg Leu Asp Met Glu Asp Pro Ser Cys Glu Met Cys Leu Ser Lys Asp 
165 170 175 

Lys Thr Leu Gin Asp Leu Val Tyr Asp Leu Ser Thr Ser Gly Ser Gly 
. 180 185 190 



Ser Gly Leu Pro Leu Phe Val Gin Arg Thr Val Ala Arg Thr lie Val 
195 200 205 

Leu Gin Glu lie lie Gly Lys Gly Arg Phe Gly Glu Val Trp Arg Gly 
210 215 220 

Arg Trp Arg Gly Gly Asp Val Ala Val Lys lie Phe Ser Ser Arg Glu 
225 230 235 240 

Glu Arg Ser Trp Phe Arg Glu Ala Glu lie Tyr Gin Thr Val Met Leu 
245 250 255 

Arg His Glu Asn lie Leu Gly Phe lie Ala Ala Asp Asn Lys Asp Asn 
260 265 270 

Gly Thr Trp Thr Gin Leu Trp Leu Val Ser Asp Tyr His Glu His Gly 
275 280 285 

Ser Le\i Phe Asp Tyr Leu Asn Arg Tyr Thr Val Thr lie Glu Gly Met 
290 295 300 

^Jle Lys Leu Ala Leu Ser Ala Ala Ser Gly Leu Ala His Leu His Met 
^05 310 315 320 

Ji^lu He Val Gly Thr Gin Gly Lys Pro Gly He Ala His Arg Asp Leu 

H 325 330 335 

ty 

MLys Ser Lys Asn He Leu Val Lys Lys Asn Gly Met Cys Ala He Ala 
ffj 340 345 350 

sAsp Leu Gly Leu Ala Val Arg His Asp Ala Val Thr Asp Thr He Asp 
13 355 360 365 

MHe Ala Pro Asn Gin Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu 
M 370 375 380 

|J/al Leu Asp Glu Thr He Asn Met Lys His Phe Asp Ser Phe Lys Cys 

* 385 390 395 400 

Ala Asp He Tyr Ala Leu Gly Leu Val Tyr Trp Glu He Ala Arg Arg 
405 410 415 

Cys Asn Ser Gly Gly Val His Glu Asp Tyr Gin Leu Pro Tyr Tyr Asp 
420 425 430 

Leu Val Pro Ser Asp Pro Ser lie Glu Glu Met Arg Lys Val Val Cys 
435 440 445 

*Asp Gin Lys Leu Arg Pro Asn Val Pro Asn Trp Trp Gin Ser Tyr Glu 
450 455 460 



Ala Leu Arg Val Met Gly 'Lys Met Met Arg Glu Cys Trp Tyr Ala Asn 
465 470 475 480 



Gly Ala Ala Arg Leu Thr Ala Leu Arg He Lys Lys Thr Leu Ser Gin 
485 490 495 



Leu Ser Val Gin Glu Asp Val Lys He 
500 505 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1952 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(v) FRAGMENT TYPE: internal 
U (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mouse 

(ix) FEATURE: 
f3 (A) NAME/KEY: CDS 

^ (B) LOCATION: 187.. 1692 

(xi) SEQUENCE, DESCRIPTION: SEQ ID NO: 17: 

^ AAGCGGCGGC AGAAGTTGCC GGCGTGGTGC TCGTAGTGAG GGCGCGGAGG ACCCGGGACC 60 
J;? TGGGAAGCGG CGGCGGGTTA ACTTCGGCTG AATCACAACC ATTTGGCGCT GAGCTATGAC 120 
^AAGAGAGCAA ACAAAAAGTT AAAGGAGCAA CCCGGCCATA AGTGAAGAGA GAAGTTTATT 180 



UGATAAC ATG CTC TTA CGA AGC TCT GGA AAA TTA AAT GTG GGC ACC AAG 228 
Met Leu Leu Arg Ser Ser Gly Lys Leu Asn Val Gly Thr Lys 
1 5 10 

□ AAG GAG GAT GGA GAG AGT ACA GCC CCC ACC CCT CGG CCC AAG ATC CTA 276 
^Lys Glu Asp Gly Glu Ser Thr Ala Pro Thr Pro Arg Pro Lys He. Leu 
15 20 25 30 

CGT TGT AAA TGC CAC CAC CAC TGT CCG GAA GAC TCA GTC AAC AAT ATC 324 
Arg Cys Lys Cys His His His Cys Pro Glu Asp Ser Val Asn Asn He 
35 . 40 45 

TGC AGC ACA GAT GGG TAC TGC TTC ACG ATG ATA GAA GAA GAT GAC TCT 372 
Cys Ser Thr Asp Gly Tyr Cys Phe Thr Met lie Glu Glu Asp Asp Ser 
50 55 60 

GGA ATG CCT GTT GTC ACC TCT GGA TGT CTA GGA CTA GAA GGG TCA GAT 420 
9 Gly Met Pro Val Val Thr Ser Gly Cys Leu Gly Leu Glu Gly Ser Asp 
65 70 75 



TTT CAA TGT CGT GAC ACT: CCC ATT CCT CAT CAA AGA AGA TCA ATT GAA 
Phe Gin Cys Arg Asp Thr Pro He Pro His Gin Arg Arg Ser He Glu 
80 . 85 90 



468 



TGC TGC ACA GAA AGG AAT GAG TGT AAT AAA GAG CTC CAC CCC ACT CTG 516 
Cys Cys Thr Glu Arg Asn Glu Cys Asn Lys Asp Leu His Pro Thr Leu 
95 100 105 110 

CCT CCT CTC AAG GAC AGA GAT TTT GTT GAT GGG CCC ATA CAC CAC AAG 564 
Pro Pro Leu Lys Asp Arg Asp Phe Val Asp Gly Pro lie His His Lys 
115 120 125 

GCC TTG CTT ATC TCT GTG ACT GTC TGT AGT TTA CTC TTG GTC CTC ATT 612 
Ala Leu Leu lie Ser Val Thr Val Cys Ser Leu Leu Leu Val Leu lie 
130 135 140 

ATT TTA TTC TGT TAC TTC AGG TAT AAA AGA CAA GAA GCC CGA CCT CGG 660 
lie Leu Phe Cys Tyr Phe Arg Tyr Lys Arg Gin Glu Ala Arg Pro Arg 
145 150 155 

TAC AGC ATT GGG CTG GAG CAG GAC GAG ACA TAC ATT CCT CCT GGA GAG 708 
Tyr Ser lie Gly Leu Glu Gin Asp Glu Thr Tyr lie Pro Pro Gly Glu 
1'60 165 170 

TCC CTG AGA GAC TTG ATC GAG CAG TCT CAG AGC TCG GGA AGT GGA TCA 756 
Ser Leu Arg Asp Leu lie Glu Gin Ser Gin Ser Ser Gly Ser Gly Ser 
175 180 185 190 

GGC CTC CCT CTG CTG GTC CAA AGG ACA ATA GCT AAG CAA ATT CAG ATG 804 
Gly Leu Pro Leu Leu Val Gin Arg Thr lie Ala Lys Gin He Gin Met 
195 200 205 

GTG AAG CAG ATT GGA AAA GGC CGC TAT GGC GAG GTG TGG ATG GGA AAG 852 
Val Lys Gin He Gly Lys Gly Arg Tyr Gly Glu Val Trp Met Gly Lys 
210 215 220 

TGG CGT GGA GAA AAG GTG GCT GTG AAA GTG TTC TTC ACC ACG GAG GAA 900 
Trp Arg Gly Glu Lys Val Ala Val Lys Val Phe Phe Thr Thr Glu Glu 
225 230 235 

GCC AGC TGG TTC CGA GAG ACT GAG ATA TAT CAG ACG GTC CTG ATG CGG 948 
Ala Ser Trp Phe Arg Glu Thr Glu He Tyr Gin Thr Val Leu Met Arg 
240 245 250 

CAT GAG AAT ATT CTG GGG TTC ATT GCT GCA GAT ATC AAA GGG ACT GGG 996 
His Glu Asn He Leu Gly Phe He Ala Ala Asp He Lys Gly Thr Gly 
255 260 265 270 

TCC TGG ACT CAG TTG TAC CTC ATC ACA GAC TAT CAT GAA AAC GGC TCC 1044 
Ser Trp Thr Gin Leu Tyr Leu He Thr Asp Tyr His Glu Asn Gly Ser 
275 280 285 

CTT TAT GAC TAT CTG AAA TCC ACC ACC TTA GAC GCA AAG TCC ATG CTG 1092 
Leu Tyr Asp Tyr Leu Lys Ser Thr Thr Leu Asp Ala Lys Ser Met" Leu 
290 295 300 



AAG CTA GCC TAC TCC TCT GTC AGC GGC CTA TGC CAT TTA CAC ACG GAA 1140 
Lys Leu Ala Tyr Ser Ser Val Ser Gly Leu Cys His Leu His Thr Glu 
305 310 315 

ATC TTT AGC ACT CAA GGC AAG CCA GCA ATC GCC CAT CGA GAC TTG AAA 1188 
lie Phe Ser Thr Gin Gly Lys Pro Ala lie Ala His Arg Asp Leu Lys 
320 325 330 

AGT AAA AAC ATC CTG GTG AAG AAA AAT GGA ACT TGC TGC ATA GCA GAC 1236 
Ser Lys Asn lie Leu Val Lys Lys Asn Gly Thr Cys Cys lie Ala Asp 
335 . 340 345 350 

CTG GGC TTG GCT GTC AAG' TTC ATT AGT GAC ACA AAT GAG GTT GAC ATC 12 84 

Leu Gly Leu Ala Val Lys Phe lie Ser Asp Thr Asn Glu Val Asp lie 
355 360 365 

CCA CCC AAC ACC CGG GTT GGC ACC AAG CGC TAT ATG CCT CCA GAA GTG 1332 
Pro Pro Asn Thr Arg Val Gly Thr Lys Arg Tyr Met Pro Pro Glu Val 
370 375 380 

CTG GAC GAG AGC TTG AAT AGA AAC CAT TTC CAG TCC TAC ATT ATG GCT 13 80 

OLeu Asp Glu Ser Leu Asn Arg Asn His Phe Gin Ser Tyr lie Met Ala 

ift 385 390 395 

Q GAC ATG TAC AGC TTT GGA CTC ATC CTC TGG GAG ATT GCA AGG AGA TGT 142 8 

[TiiAsp Met Tyr Ser Phe Gly Leu lie Leu Trp Glu lie Ala Arg Arg Cys 
« 400 405 410 

J^GTT TCT GGA GGT ATA GTG GAA GAA TAC CAG CTT CCC TAT CAC GAC CTG 1476 
^Val Ser Gly Gly lie Val Glu Glu Tyr Gin Leu Pro Tyr His Asp Leu 
L.415 420 425 430 

GTG CCC AGT GAC CCT TCT TAT GAG GAC ATG AGA GAA ATT GTG TGC ATG 1524 
f^Val Pro Ser Asp Pro Ser Tyr Glu Asp Met Arg Glu lie Val Cys Met 
^ 435 440 445 

HAAG AAG TTA CGG CCT TCA TTC CCC AAT CGA TGG AGC AGT GAT GAG TGT 1572 
Lys Lys Leu Arg Pro Ser Phe Pro Asn Arg Trp Ser Ser Asp Glu Cys 
450 455 460 

CTC AGG CAG ATG GGG AAG CTT ATG ACA GAG TGC TGG GCG CAG AAT CCT 1620 
Leu Arg Gin Met Gly Lys Leu Met Thr Glu Cys Trp Ala Gin Asn Pro 
465 470 475 

GCC TCC AGG CTG ACG GCC CTG AGA GTT AAG AAA ACC CTT GCC AAA ATG 1668 
Ala Ser Arg Leu Thr Ala Leu Arg Val Lys Lys Thr Leu Ala Lys Met 
480 485 490 

TCA GAG TCC CAG GAC ATT AAA CTC TGACGTCAGA TACTTGTGGA CAGAGCAAGA 1722 
Ser Glu Ser Gin Asp He Lys Leu 
495 500 

ATTTCACAGA AGCATCGTTA GCCCAAGCCT TGAACGTTAG CCTACTGCCC AGTGAGTTCA 1782 




GACTTTCCTG GAAGAGAGCA CGGTGGGCAG ACACAGAGGA ACCCAGAAAC ACGGATTCAT 1842 
CATGGCTTTC TGAGGAGGAG AAACTGTTTG GGTAACTTGT TCAAGATATG ATGCATGTTG 1902 
CTTTCTAAGA AAGCCCTGTA TTTTGAATTA CCATTTTTTT ATAAAAAAAA 1952 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 502 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Leu Leu Arg Ser Ser Gly Lys Leu Asn Val Gly Thr Lys Lys Glu 
15 10 15 

Asp Gly Glu Ser Thr Ala Pro Thr Pro Arg Pro Lys He Leu Arg Cys 
20 25 30 

^Lys Cys His His His Cys Pro Glu Asp Ser Val Asn Asn He Cys Ser 
i 35 40 45 

^Thr Asp Gly Tyr Cys Phe Thr Met He Glu Glu Asp Asp Ser Gly Met 
t 50 55 60 

^Pro Val Val Thr Ser Gly Cys Leu Gly Leu Glu Gly Ser Asp Phe Gin 

65 70 75 80 

^Cys Arg Asp Thr Pro He Pro His Gin Arg Arg Ser He Glu Cys Cys 
i 85 90 95 

^Thr Glu Arg Asn Glu Cys Asn Lys Asp Leu His Pro Thr Leu Pro Pro 
3 100 105 110 

Leu Lys Asp Arg Asp Phe Val Asp Gly Pro He His His Lys Ala Leu 
115 120 125 

Leu He Ser Val Thr Val Cys Ser Leu Leu Leu Val Leu He He Leu 
130 135 140 

Phe Cys Tyr Phe Arg Tyr Lys Arg Gin Glu Ala Arg Pro Arg Tyr Ser 
145 150 155 160 

He Gly Leu Glu Gin Asp Glu Thr Tyr He Pro Pro Gly Glu Ser Leu 
165 170 175 

Arg Asp Leu He Glu Gin Ser Gin Ser Ser Gly Ser Gly Ser Gly Leu 
180 185 190 

Pro Leu Leu Val Gin Arg Thr He Ala Lys Gin He Gin Met Val Lys 
195 200 205 



Gin He Gly Lys Gly Arg Tyr Gly Glu Val Trp Met Gly Lys Trp Arg 
210 215 220 



Gly Glu Lys Val Ala Val Lys Val Phe Phe Thr Thr Glu Glu Ala Ser 
225 230 235 240 

Trp Phe Arg Glu Thr Glu He Tyr Gin Thr Val Leu Met Arg His Glu 
245 250 255 

Asn He Leu Gly Phe He Ala Ala Asp He Lys Gly Thr Gly Ser Trp 
260 265 270 

Thr Gin Leu Tyr Leu He Thr Asp Tyr His Glu Asn Gly Ser Leu Tyr 
275 280 285 

Asp Tyr Leu Lys Ser Thr Thr Leu Asp Ala Lys Ser Met Leu Lys Leu 
290 295 300 

Ala Tyr Ser Ser Val Ser Gly Leu Cys His Leu His Thr Glu lie. Phe 
305 310 315 320 

jSer Thr Gin Gly Lys Pro Ala He Ala His Arg Asp Leu Lys Ser Lys 
5 325 330 335 

lAsn He Leu Val Lys Lys Asn Gly Thr Cys Cys He Ala Asp Leu Gly 
1 340 345 350 

I Leu Ala Val Lys Phe He Ser Asp Thr Asn Glu Val Asp He Pro Pro 
1 355 360 365 

^Asn Thr Arg Val Gly Thr Lys Arg Tyr Met Pro Pro Glu Val Leu Asp 
370 375 380 

^Glu Ser Leu Asn Arg Asn His Phe Gin Ser Tyr He Met Ala Asp Met 
385 390 395 400 

Tyr Ser Phe Gly Leu He Leu Trp Glu He Ala Arg Arg Cys Val Ser 
405 410 415 

Gly Gly He Val Glu Glu Tyr Gin Leu Pro Tyr His Asp Leu Val Pro 
420 425 430 

Ser Asp Pro Ser Tyr Glu Asp Met Arg Glu He Val Cys Met Lys Lys 
435 440 445 

Leu Arg Pro Ser Phe Pro Asn Arg Trp Ser Ser Asp Glu Cys Leu Arg 
450 455 460 



Gin Met Gly Lys Leu Met Thr Glu Cys Trp Ala Gin Asn Pro Ala Ser 
465 470 475 480 



Arg Leu Thr Ala Leu Arg Val Lys Lys Thr Leu Ala Lys Met Ser Glu 
485 490 495 



Ser Gin Asp lie Lys Leu 
500 



(2) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 



GCGGATCCTG TTGTGAAGGN AATATGTG 



y32) INFORMATION FOR SEQ ID NO: 20: 

( i ) SEQUENCE. CHARACTERISTICS : 

Q (A) LENGTH: 24 base pairs 

Ui (B) TYPE: nucleic acid 

rj (C) STRANDEDNESS: single 

}fk (D) TOPOLOGY: linear 

Jf: (ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
L (iv) ANTI -SENSE: NO 

rj (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 

ftSCGATCCGTC GCAGTCAAAA TTTT 

2. " 



(2) INFORMATION FOR SEQ ID NO : 21: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 21 
*GCGGATCCGC GATATATTAA AAGCAA 



(2) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE : YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
CGGAATTCTG GTGCCATATA 



(2) INFORMATION FOR SEQ ID NO: 23: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
^ (ii) MOLECULE TYPE: cDNA 
% Z (iii) HYPOTHETICAL: NO 
'!? (iv) ANTI -SENSE: NO 

2; (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 23 

Id ATTCAAGGGC ACATCAACTT CATTTGTGTC ACTGTTG 



INFORMATION FOR SEQ ID NO: 24: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 
GCGGATCCAC CATGGCGGAG TCGGCC 




* 

(2) INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) T*PE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
AACACCGGGC CGGCGATGAT 



(2) INFORMATION FOR SEQ ID NO: 26: 
& (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

Q (ii) MOLECULE TYPE: peptide 

In (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 

^ Gly Xaa Gly Xaa Xaa Gly 

Pi 1 5 

£0 

!„ (2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 
y (A) LENGTH: 6 amino acids 

f" b (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
O (ii) MOLECULE TYPE: peptide 

IK (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 

Asp Phe Lys Ser Arg Asn 
1 5 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 

Asp Leu Lys Ser Lys Asn 
1 5 



INFORMATION FOR SEQ ID NO: 29: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 

Gly Thr Lys Arg Tyr Met 
1 5 



(2) 



